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Sir: 

I. REAL PARTY IN INTEREST 

The real party in interest of the present application is Ryogen LLC . Ryogen LLC is 
the owner of the present application by way of an assignment from the inventor, James W. 
Ryan of all rights, title, and interest. 

II. RELATED APPEALS AND INTERFERENCES 

There are no appeals or interferences related to the present application. 

III. STATUS OF CLAIMS 

Claims 7, 10, 15-18, 20, 24, 25, 30 and 31 stand finally rejected by the Examiner as 
noted in the Advisory Action mailed October 29, 2007. Claims 1-6, 8-9, 11, 13, 19, 21 and 
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26-29 have been canceled. Claims 12, 14, 22, 23 and 32-38 have been withdrawn. The 
rejection of claims 7, 10, 15-18, 20, 24, 25, 30 and 31 are appealed. 

IV. STATUS OF AMENDMENTS 

No amendments have been submitted subsequent to the Examiner's Final Rejection. 

V. SUMMARY OF THE CLAIMED SUBJECT MATTER 



A. Independent Claim 7 



Claim Elements 


Support in 
specification 


An isolated nucleic acid molecule 20-5 1039 contiguous 
nucleotides in length consisting of a reverse or forward strand of a 
region of SEQ ID NO:4 


Page 9, line 34; 
page 10, lines 22- 
26 


wherein said region is selected from the group consisting of a 5'- 
non coding region depicted in nucleotides 51039-41739 of SEQ 
ID NO:4, a 3'-non-coding region depicted in nucleotides 9503-1 
of SEQ ID NO:4, a contiguous intron-exon region between 
nucleotides 41738-9502 of SEQ ID NO:4, wherein a sequence 
segment comprising 41738-9502 of SEQ ID NO:4 encodes human 
mouse double minute 2 homolog depicted in SEQ ID NO:2, a 
contiguous exon-intron region between nucleotide 41738-9502 of 
SEQ ID NO:4, wherein a sequence segment comprising 41738- 
9502 of SEQ ID NO:4 encodes human mouse double minute 2 
homolog depicted in SEQ ID NO:2, an intron depicted in 
nucteotiaes JdJoj-4Ud4j, JoJUy-J3iz/, 3Zyy4-Zyolo, zyj>o4- 
25577, 25507-25384, 25287-21 169, 21006-141 10, 13953-13267, 
and/or 13188-10665, a region comprising a dinucleotide of the 
following group: 41739-41738, 40645-40646, 36309-36310, 
36384-36385, 32994-32995, 33126-33127, 29564-29565, 29615- 
29616, 25507-25508, 25287-25288, 25383-25384, 25576-25577, 
21006-21007,21168-21169, 14109-14110, 13953-13954, 13266- 
13267, 13188-13189, 10664-10665 and/or 9504-9503 


Page 10: Table 2 


a transcription binding site selected from the group consisting of 

BINDING SITES huMDM2, location in SEQ ID NO:4 

AP1_C: 36-46, 2876-2886; 

AP4_Q5: 7944-7980; 

AP4_Q6: 7943-59, 8924-8940, 9294-9310; 


Page 9, line 29 to 
page 10, line 2; 
Table 3 on pages 
10-12 
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ARNTOl: 

BRN2_01: 

CAAT_01: 

CDPCR3HD_01: 

CEBPB_01: 
4801,6855-6869; 

CRELOl: 

DELTAEF1_01: 

FREAC7_01: 

GATA1_04: 

GATA1_05: 

GATA2_02: 

GATA2_03: 
7393-7405: 

GATA3_02: 

GATA3_03: 
7041-7053,7589-7601; 



1682-1706, 2193-2217, 9201-9225; 
1040-1058,7803-7821; 
3292-3306; 
6522-6540; 

1424-1438,3917-3931,4178-4192,4787- 

5630-5642; 
83-95,6328-6340; 
2757-2773, 5154-5170, 5823-5839; 
4846-4858,7017-7029; 
8464-8476; 

6045-6057, 6073-6085, 6142-6154; 
2489-2501, 3323-3335, 3384-3396, 

3264-3276, 6870-6882: 
40-52, 5729-5741, 6529-6541, 6874-6886, 



GATA_C: 7 
HFH201: 



HFH3_01: 
9418,9418-9434; 



HFH8_01: 
IK2_01: 



MZF1_01: 
5432; 



349-7361,8188-8200; 

1743-1759,7995-8011; 

502-518, 1739-1755, 4160-4176, 9402- 

8184-8200; 

951-963,3588-3600; 

1202-1210, 1447-1455, 4997-4005, 5424- 



3 



NF1_Q6: 1480-1500, 8166-8182; 




NFAT_Q6: 4190-4208, 6009-6027; 




NKX25_01: 741-755, 1648-1662, 1885-1899, 1984- 
1998, 3609-3623, 4928-4942, 5060-5074, 5889-5903, 8850-8864, 
9190-9204; 




NKX25_02: 2584-2599, 2970-2984, 4644-4658, 5179- 
5193,6482-6496; 




NMYC_01: 2560-2572; 




RORA1_01: 220-238,2638-2656; 




S8_01: 4644-4656, 4842-4854, 4845-4857, 5200- 
5212, 5371-5383, 5735-5747, 6482-6494, 6541-6553, 6544-6556, 
6772-6784, 7270-7292, 7273-7285; 




SOX5_01: 1355-1371, 1430-1446,3094-3110,3155- 
3171, 4669-4685, 4692-4708, 4789-4805; 




SRY_02: 4164-4180,5665-5681; 




TATA 01: 1261-1277,2574-2590,2723-2739,2733- 
2749, 2770-2786, 4199-4215, 4206-4222; 




TATA_C: 5900-5916, 7456-7472, 7702-7718, 7917- 
7933; and 




XFD2_01: 7702-7218,7917-7933; 




a transcription binding site selected from the group consisting of 

BINDING SITES huMDM2, location in SEQ ID NO:4 
AP1_C: 12109-12119, 12695-12705, 22600-22610, 
24166-24176, 31311-31321, 35234-35244, 39184-39194; 




AP1_Q2: 11952-11962, 12068-12078, 14798-14808, 
21748-21758, 22613-22623, 23676-23686, 26562-26572, 30046- 
30056; 




API Q4: 12695-12705,31311-31321,35234-35244, 
36295-36305, 38784-38794, 39188-39198; 




AP4_Q6: 31635-31651; 
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BRN2_01: 
40027-40045; 



CDPCR3HD_01: 
29344-29362; 

CEBPB_01: 

CREL01: 

DELTAEF1_01: 



13448-13466, 14764-14782, 28094-28112, 

11288-11302,15054-15068; 
11286-11304, 13284-13302, 20846-20864, 

29241-29255; 

36091-36103,38873-38885; 
18083-18095, 20385-20397, 26955-26967; 



FREAC7_01: 11982-11998, 15187-15202, 16523-16539, 

16529-16545, 16587-16603, 16604-16620, 16676-16642, 16633- 
16649, 16644-16660, 16650-16666, 16657-16673, 16673-16689, 
16762-16778, 21332-21348, 25689-25700, 26529-26545, 27767- 
27783,29495-29511; 

GATA1_02: 10916-10928, 15775-15789, 18162-18174, 

26088-26100,32518-32530; 



GATA1_03: 



28012-28024; 



GATA1_04: 11153-11165, 11630-11642, 13778-13790, 

17439-17451, 19300-19312, 21606-21618, 22743-22755, 23747- 
23759, 25806-25818, 26529-26541, 29424-29436, 30455-30467, 
32761-32778, 33352-33364, 33960-33972, 36101-36113, 40007- 
40019; 



GATA1_05: 



GATA1_06: 
37855-37867; 



11590-11602, 26550-26562, 36737-36749; 
18772-18784, 23054-23066, 35568-35580, 



GATA2_02: 20755-20767, 30830-30842, 34755-34767, 

36285-36297, 39143-39155, 39641-39653, 40586-40598; 

GATA2_03: 13535-13547,22711-22723,23161-23173, 
25028-25040, 27237-27249, 36277-36289; 

GATA3_02: 11558-11570, 16470-16482, 17225-17237, 

19619- 19631, 22156-22168, 22443-22455, 24713-24725, 27619- 
27631, 32716-32728, 34124-34136, 34163-34175, 36832-36844, 
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38403-38415; 




GATA3_03: 10869-10881, 11515-11527, 13845-13857, 
17221-17233, 18952-18964, 20050-20062, 40171-40183; 




GATA_C: 15848-15860, 18899-18911,23640-23652, 
29072-29084, 30881-30893, 33198-33210, 37472-37484, 38621- 
38633; 




GFI1_01: 35469-35481, 35492-35504; 




HFH2_01: 15939-15955, 24636-24652, 25866-25882, 
32171-32187, 35372-35388, 39457-35473; 




HFH3_01: 13340-13356, 19218-19234,21328-21344, 
21336-21352, 21344-21360, 28062-28078, 32125-32141; 




HFH8_01: 14133-14149, 22578-22584; 




HNF3B_01: 13150-13166, 16505-16521, 25264-25280, 
29443-29459, 37654-37670; 




IK2_01: 11547-11559, 17144-17156, 18961-18973, 
23883-23895, 27617-27629, 28908-28920, 29241-29253, 30752- 
30764, 34768-34780; 




LYF1_01: 12319-12331, 19191-19203, 37226-37238, 
39430-39442; 




MAX_01: 22974-22986,33339-33351; 




MZF1_01: 26105-26113, 35187-35195; 




NF1_Q6: 12048-12064,33334-33354; 




NFAT_Q6: 13295-13313, 14157-14175, 14311-14329, 
14414-14432, 18269-18287. 19326-19344,20801-20819,21177- 
21195, 22537-22555, 23861-23879, 25392-25410, 25879-25897, 
27524-27542, 30636-30654, 30718-30736, 31525-31543, 33655- 
33673, 34726-34744, 34917-34535, 34990-35008, 35979-35997, 
36479-36493, 36577-36595, 37154-37172, 40224-40242, 40365- 
40383; 




NKX25 01: 12041-12055, 12340-12354, 12471-12485, 12742- 
12756, 12877-12891, 13849-13863, 18995-19009, 21440-21454, 
21883-21897, 28426-28440, 30964-30978, 32033-32047, 32265- 
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32279; 

NKX25_02: 10998-11012, 12711-12725, 14131-14145, 14726- 
14740, 16024-16038; 

NMYC 01: 18753-18765, 18754-18766, 23076-23088, 30534- 
30546, 34400-34412; 

RORA1_01: 13134-13152, 22966-22984, 24934-24952, 33341- 
33359, 34760-34778; 

S8_01: 11000-11012, 11977-11989, 12048-12060, 12051- 

12063, 13747-13759, 13923-13935, 13926-13938, 14676-14688, 
14679-14691, 16026-16038, 16313-16325, 16316-16328, 17515- 
17527, 20756-20768, 20759-20771, 23154-23166, 23157-23169, 
25198-25210, 25201-25213, 26651-26663, 27508-27520, 27511- 
27523, 29450-29462, 29478-28490, 29775-29787, 29778-29790, 
29813-29825, 29816-29828, 31329-31341, 31677-31689, 31680- 
31692, 31732-31744, 31735-31747, 36137-36149, 36140-36152, 
36812-36824, 36815-36827, 37413-37425, 38679-38691, 39474- 
39486, 39477-39489; 

SOX5_01: 27397-27413, 27572-27588, 28100-28116, 29230- 
29246, 29439-29455, 30690-30706, 31595-31611, 33871-33887, 
34113-34129, 34624-34640, 37668-37684, 38582-38598, 39124- 
39140, 40410-40426; 

SRY_02: 20016-20032, 22410-22426, 27329-27345, 29162- 
29178, 29499-29515, 30646-30662, 31503-31519, 35928-35944, 
37324-37340; 

TATA_01: 32722-32738, 32729-32745, 32807-32823, 33825- 
33841, 34120-34136, 35433-35449, 36593-36609; 

TATA_C: 11015-11031, 11817-11833, 13635-13651, 14930- 
14946; 

TCF11_01: 18543-18549, 22574-22580, 31281-31297, 31489- 
31505,38754-38770; 

USFOl: 23075-23087,32577-32589; 

VMYB_02: 11526-11538, 17384-17396, 18400-18412, 19549- 
19561, 22188-22200, 40486-40508 and 

XFD2_01: 16620-16636. 18153-18169, 22102-22118, 23141- 
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And a transcription binding site selected from the group consisting 
of 

BINDING SITES huMDM2 location in SEQ ID NO:4 
AP1_C: 44584-44594, 49069-49079: 

42174-42184, 45217-45227, 48422-48422, 



AP1_Q2: 
50447-50457; 



AP1_Q4: 



42702-42712, 50806-50816; 



AP4_Q6: 42117-42133,42118-42134, 
42244-42260, 45432-45448; 45433-45449, 
46609-46625; 

BRN2_01: 42310-42328, 44022-44040, 47514-47532, 

48900-48918,48967-48985; 

CAAT_01: 44866-44880; 

CDPCR3HD_01: 45671-45689, 49219-49237; 

CREL_01 : 42437-42449, 49797-49809; 

FREAC7_01: 47026-47042, 47292-47308, 47658-47674; 

GATA1_02: 43482-43494, 48926-48938, 49284-49296; 

GATA1_03: 47371-47383; 

GATA1_04: 43054-43066, 43162-43162, 43967-43979, 
45464-45476, 45916-45928, 47763-47775; 

GATA1_05: 49319-49331, 49459-49471; 

GATA1_06: 47590-47602; 

GATA2_02: 42660-42672, 43475-43487; 

GATA2_03: 43714-43726, 50948-50960; 

GATA3_02: 49155-49167, 49844-49856; 



GATA3_03: 42202-42214, 44810-44822, 

48438-48450, 49136-49148, 49337-49349, 
49869-49881; 

GATA_C: 4401 1-44023, 45256-45268, 

45823-45835, 47915-47927, 49201-49213, 
49573-49585; 



GFI1_01: 


46606-46618, 47063-47075; 


HFH3_01: 


47030-47046, 47284-47300, 47288-47304; 


IK2_01: 


45275-45287; 


LYF1_01: 


44564-44576, 46991-47003, 49567-49579; 


MAX_01: 


43234-43246, 48726-48738; 


MZF1_01: 


41772-41780, 42290-42298, 42295-42303, 



44507-44515,45105-45113,45203-45211,49948-49956, 
50774-50782; 

NF1_Q6: 50209-50229; 

NFAT_Q6: 42061-42079, 44418-44436, 46399-46417, 

47974-47992, 49267-49285, 49964-49982, 50392-50410; 

NKX25_01: 42394-42408, 43507-43521, 46115-46129; 

RORA1_01: 45073-45091, 48718-48736; 

S8_01: 43552-43564, 45214-45226, 47160-47172, 

48419-48431, 49295-49307, 50379-50391; 

SOX5_01: 43716-43732, 46351-46367, 47156-47172, 

47774-47790, 47868-47884, 47974-47990, 48915-48931, 50323- 
50339; 

TATA_01: 45588-45604, 47625-47641, 48026-48042, 

48659-48675, 49056-49072, 49079-49095, 49152-49168; 

TCF11_01: 49115-49131; 

VMYB_02: 42010-42022, 42279-42291, 44651-44663; 
XFD2_01: 42870-42886, 42910-42926. 
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B. Independent Claim 24 



Claim Element 


Support in Specification 


An isolated nucleic acid molecule 20-5000 contiguous 
nucleotides in length 

consisting of a reverse or forward strand of a contiguous exon 
intron region between nucleotides 41738-9502 of SEQ ID 
NO:4 or contiguous intron-exon region between nucleotides 
41738-9502 of SEQ ID NO:4, 

wherein a sequence segment comprising 41738-9502 of SEQ 
ID NO:4 encodes human mouse double minute 2 homolog 
depicted in SEQ ID NO:2 


Page 9, line 35 to page 10, 
line 2 

Page 10, Table 2 and page 
14, lines 29-32 

Page 2, lines 7-13; page 10, 
Table 2 



VI. Grounds of Rejection to Be Reviewed on Appeal 

Whether claim 7, 10, 15-18, 20, 24, 25, 30 and 31 are unpatentable over Muzny et al. 
("Muzny") in view of Vogelstein et al. ("Vogelstein"). 



VI. Argument 

In the final rejection dated April 16, 2007, the Office Action stated: 

It would have been obvious to one of ordinary skill in the art at 
the time the invention was made to use said cDNA to identify 
the genomic DNA that encodes the human MDM2 homolog of 
SEQ ID NO:2 on chromosome 12ql2-14. The motivation is 
provided by Vogelstein et al. who teach that it binds to 
oncogene p53 and is diagnostic of tumorigenesis. The state of 
the art provides various techniques for obtaining genomic DNA 
using cDNA probes that are usually labeled. The comparison of 
genomic and cDNA would result in the identification of regions 
comprising exon-intron and intron-exon junctions within coding 

and non-coding regions. One of ordinary skill in the art would 
have been motivated to use said non-coding regions or 
fragments thereof of at least 20 nucleotides and up to 5000 or 
51039 nucleotides (the entire length of SEQ ID NO:4) 
nucleotides for detecting splice variants of chromosome 12ql2- 
14 from genomic nucleotide samples from an individual, for 
example. As a matter of convenience a non-coding region such 
as an exon-intron or intron-exon region or fragments thereof 
can be present in a kit or on a solid support. Further, said 
support can be a microarray according to a customary use of 
nucleic acid molecules in the art. 
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Appellant respectfully traverses the rejection. In Appellant's view, it would not have 
been obvious to combine the disclosure of Muzny and Vogelstein given that there was no 
suggestion to do so. Muzny merely contains just a small portion of chromosome 12 DNA. 
Chromosome 12 is about 130 million base pairs long and is believed to contain several 
hundred genes (by analysis after 2001 and after the Applicant discovered the human MDM2 
homologue gene). Muzny et al knew that clone AC025423 (from 1V1 1-61 102) was from 
chromosome 12 but there is no evidence in the NCBI report of a sub-assignment to the p- or 
q-arm. Further, there is no evidence that Muzny knew whether the clone did or did not 
contain one or more genes and particularly whether it contained the gene encoded by SEQ ID 
NO:4. As will be discussed in further detail below, the MDM2 cDNA constitutes just 1.6% 
of the clone disclosed by Muzny. Undue experimentation would have been required not only 
to locate the MDM2 gene but also identify exon-intron junctions. 

Even assuming arguendo that there had been such a motivation, one of ordinary skill 
in the art would not have obtained the isolated nucleic acid molecules of the present 
invention, the specifically recited non-coding regions of the MDM2 gene located between 
nucleotides 20-51039 of SEQ ID NO:4. This is because, as noted in previous responses, 
Vogelstein placed the human MDM2 homologue gene at 12ql2-14. Actually, this finding is 
incorrect. After the publication of Vogelstein, the gene was found not to be located at 12ql2- 
14. The gene is actually several millions of base pairs away at 12ql5 (see, for example, 
Andersen et al., 1996, Mammalian Genome 7:780-783, Bureau, 1995, Genomics 28: 109-112 
and Genecard, attached hereto as Exhibit 1 and previously made of record). Thus, even if 
these two references were indeed combined, the ordinary skilled artisan would have looked 
for the MDM2 gene in the wrong location and thus would not have obtained the claimed 
sequences. 

In response to Remarks made by Appellant in the amendment submitted on January 

23, 2007, the final Office Action states 

With regard to the 103(a) rejection, Applicant argues that 
"There was certainly no indication given in the cited art either 
singly or in combination regarding the location of the MDM2 
gene encoding human mouse double minute 2 homolog 
depicted in SEQ ID NO:2 on AC025423" (Remarks, page 13). 
As was explained in the previous Office action mailed August 
25, 2006 "the exact location of the gene is not necessary as long 
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as its sequence is known as in the instant case" (page 5). At the 
time the invention was made finding non-coding regions using 
cDNA and genomic DNA was standard technique. Watson et al 
(1992) "Recombinant DNA" teach that once the first genes 
were cloned, introns were identified by comparing the cloned 
genomic DNA with the corresponding cloned cDNA (page 137, 
2nd column). In the case of the instant application, both 
genomic and cDNA were known. They only needed to be 
compared in order to identify intron-exon junctions. Applicant 
further argues that "There is no prior art that defines the 
complete genomic structure of a particular gene. This is 
necessary in order to identify the claimed noncoding sequences 
in the instant invention" (page 13). This argument is similar to 
the issue of location and is responded above. Applicants further 
argues that "One of ordinary skill in the art would have no idea 
as to the number of introns and the length of the 5' and 3' 
noncoding sequences in the MDM2 gene" (page 14). It is 
agreed that said number and length were not known before the 
invention. If they were known, the rejection would be 102. The 
current rejection is 103(a) stating that it would have been 
obvious to compare genomic DNA and cDNA and identify the 
number of introns and the length of the 5' and 3' noncoding 
sequences in the MDM2 gene. Applicant further argues "The 
Examiner's assertion that one of ordinary skill in the art would 
have expected that the location is often imprecise actually 
further supports Applicant's assertion the claimed sequences 
were indeed nonobvious. If the location is imprecise, where 
would one of ordinary skill in the art know where to look?" 
(page 15). This is not persuasive because the precise location is 
not necessary when the 2 sequences that need to be compared 
are known. They would be obvious because the genomic DNA 
was already sequenced and cDNA was made. Applicant further 
argues that "It should be noted that annotation of the human 
genomic DNA was still relatively new as of the priority date of 
the instant application. Even assuming arguendo that finding 
noncoding regions using cDNA and genomic DNA was 
standard technique, the means to make the invention does not 
predict the claimed invention. Specifically the means used to 
make the invention do not predict the claimed nucleic acid 
molecules. BLASTN, TBLASTN, etc. do not themselves 
predict gene-specific results. It is Applicant's view that only 
general guidance is provided. This is not sufficient" (page 16). 
This is not persuasive because there is no need to predict the 
sequence itself, it was known before. The invention is in the 
identification of the specific fragments (exons/introns) of the 
known sequence. Applicant appears to argue as if the genomic 
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DNA was not sequenced prior to the instant invention. 
Applicant further argues that "it is Applicant's view that given 
that the cDNA constitutes just 1.6% of the AC025423 sequence 
is in itself evidence of the unpredictability of determining the 
entire sequence of the MDM2 gene and thus contiguous intron- 
exon and exon-intron regions. The Examiner is in effect 
asserting that just because Applicant did isolate the claimed 
nucleic acid molecule, it must have been obvious to do so. It is 
well established case law that the fact that the inventors were 
ultimately successful is irrelevant to whether one of ordinary 
skill in the art at the time the invention was made would have 
reasonably expected success" (paragraph bridging pages 16-17). 
This is not persuasive because the size of the cDNA does not 
matter. ESTs of smaller size are used for comparison to 
genomic sequences. Applicant does not show what unexpected 
difficulties other than routine comparison of the genomic DNA 
and cDNA were encountered during the time the invention was 
made. 

It appears that the Examiner is asserting that just because a genomic clone containing 
chromosome 12 sequences had been isolated, the MDM2 cDNA was known and that 
techniques were known for finding noncoding regions, one of ordinary skill in the art would 
have had a reasonable expectation of success of obtaining the claimed sequences. Further, the 
Examiner is asserting that it is not significant that cDNA just constitutes just 1 .6% of the 
AC025423 sequence. Appellant disagrees for a number of reasons. 

First, Appellant asserts that the Watson disclosure is of limited relevance here. 

Specifically, Watson specifically states on page 137 

It should be noted that at the time that the electron microscopy 
experiments on adenovirus were done, no one had clone a 
cellular gene yet. Once the first genes were cloned, introns 
were identified by comparing the cloned genomic DNA with the 
corresponding cloned cDNA. 

This is very different from the situation with respect to the isolated non-coding fragments of 

the instant invention. Muzny merely discloses the sequence of a genomic clone containing 

chromosome 12 sequences, not isolated SEQ ID NO:4. No indication is provided is provided 

in the Muzny disclosure as to which portion of chromosome 12 has actually been sequenced. 

Vogelstein merely discloses the MDM2 cDNA. Applicant was the first to identify SEQ ID 

NO:4 and determine that it did indeed encode MDM2 and in particular determine the claimed 

noncoding sequences. 
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Second, Appellant asserts that there would not be a reasonable expectation of success 
of obtaining the claimed noncoding sequences of SEQ ID NO:4 in view of the cited 
references. Vogelstein placed the human MDM2 homologue gene at 12ql2-14. As noted 
above, there was actually a previous disclosure stating that the MDM2 was located between 
12ql4.3-15 (see, for example, Andersen et al., 1996, Mammalian Genome 7:780-783 and 
Bureau, 1995, Genomics 28: 109-112, submitted and disclosed in previous response attached 
hereto as Exhibit 1). However, given the conflicting locations published as of the priority 
date of the instant application, one of ordinary skill in the art would not have known which 
location was actually correct. This situation in Appellant's view would constitute undue 
experimentation. 

Third, Appellant disagrees with the assertion that the size of the cDNA does not matter 
since ESTs of smaller size are used for comparison to genomic sequences. In Appellant's 
view, the size of the cDNA relative to the isolated genomic clone is of particular significance. 
If, for example, the cDNA constituted 50% of a particular genomic clone, considerable less 
experimentation would be involved in determining the sequence of a particular gene than 
when it is merely 1.6% of the genomic clone. 



B. Advisory Action 

The Advisory Action states: 

Applicant reiterates the previous arguments that have been 
answered in the Final Office Action mailed 4/16/07. As 
correctly noted by Applicant, the examiner asserts that if 
genomic and cDNA sequences are known, one of ordinary skill 
in the art would have a reasonable expectation of success of 
obtaining the claimed sequences (Remarks, page 15). Applicant 
does not explain why the fact that cDNA constitutes 1 .6% of the 
genomic sequence leads to no expectation of success. 
Applicant further discusses KSR v Teleflex case, stating that 
"under KSR, an approach that is obvious to try is also obvious 
where normal trial and error procedures will lead to the result" 
(page 16). It appears that KSR supports the 103(a) rejection 
because Applicant does not show why the result wouldn't be 
obtained. 

In response, it has throughout prosecution been asserted that the fact that the MDM2 
cDNA constitutes just 1 .6% of the genomic sequence disclosed by Muzny is very significant 
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since in Appellant's view, undue experimentation would be involved to isolate the claimed 
sequence. More than trial and error would be involved. 

Muzny only disclosed the sequence of the clone AC025423. There was no indication 
at the time of the filing date of the instant application that AC025423 actually contained SEQ 
ID NO:4, more specifically a sequence encoding human MDM2. Muzny did not sequence all 
of chromosome 12; they only sequenced the chromosome 12 BAC clone AC025423, less than 
0.12% of the chromosome (about 157 kB vs. about 132,000 Kb). The clone's sub-site on 
chromosome 12 apparently was not then known. Muzny did not even suggest, must less state 
whether the AC025423 did or did not contain any gene. The determination of a sequence of 
genomic DNA does not necessarily imply the presence of a gene. This is because it is well 
known in the art and was certainly well known when the instant invention was made and as of 
the filing date of the instant application that most genomic DNA contains "junk" DNA, not 
genes. Appellant attaches hereto as Exhibit 2 the following review article (Wong et al., 2000, 
"Is Junk DNA Mostly Intron DNA?" Genome Research 10: 1672-1678) which discuss "junk" 
DNA in further detail. As is evident from this review article, junk DNA was well known as 
of the priority date of the above-referenced application. When obtaining genomic clone 
sequences, one of skill in the art would not have any way of knowing whether or not it 
actually contains a gene(s) or just junk DNA. Even noting that a clone has a high GC content 
is a poor guide to a sequence's likely gene content, especially given the presence of 
pseudogenes. Therefore, contra to assertions made in the Office Action, disclosure of this 
particular sequence would not indicate to the skilled artisan that this clone would necessarily 
contain a gene. 

The clone AC025423 is 150,579 nucleotides in length. The cDNA sequence only 
contains 2372 nucleotides (1.6% of AC025423). However, one of ordinary skill in the art 
would not know where or how these 2372 nucleotides are interspersed within the AC025423 
clone or if indeed it is even present. No teachings are provided as to the structure of the 
MDM2 gene itself: number and size of exons, number and size of introns, locations of exons 
and introns and number and size of 5' and 3' untranslated regions. The possibilities are close 
to infinite. Thus there would not be a reasonable expectation of success of obtaining the 
claimed sequences given that the isolation and identification of the claimed sequences 
constitutes undue experimentation. 
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It is Appellant's view that the combination of the two references would still constitute 



"obvious to try" even under KSR v Teleflex. KSR v Teleflex, 82 USPQ2d 1385; 127 S.Ct. 

1727 (2007). Appellant notes that the "Examination Guidelines for Determining Obviousness 

Under 35 U.S.C. 103 in View of the Supreme Court Decision in KSR International Co. v. 

Teleflex", 72 FR 57526 (October 10, 2007) (hereinafter "Examination Guidelines") stated 

with respect to "obvious to try" the following: 

To reject a claim based on this rationale, Office personnel must 
resolve the Graham factual inquiries. Office personnel must 
then articulate the following: 

(1) a finding that at the time of the invention, there had 
been a recognized problem or need in the art, which may 
include a design need or market pressure to solve a problem; 

(2) a finding that there had been a finite number of 
identified, predictable potential solutions to the recognized need 
or problem; 

(3) a finding that one of ordinary skill in the art could 
have pursued the known potential solutions with a reasonable 
expectation of success; and 

(4) whatever additional findings based on the Graham 
factual inquiries may be necessary, in view of the facts of the 
case under consideration, to explain a conclusion of 
obviousness. 

Ex parte Kubin, 83 USPQ2d 1410, 2007 WL 2070495 (Bd. App. & Int. 2007) was 

cited as an example in the Examination Guidelines of a situation where a finite number of 

identified, predictable solutions are provided with a reasonable expectation of success. In 

Kubin, the claimed invention was directed to 

An isolated nucleic acid molecule comprising a polynucleotide 
encoding a polypeptide at least 80% identical to amino acids 
22-221 of SEQ ID NO:2, wherein the polypeptide binds CD48. 

Id at 1412. 

In the rejection affirmed by the Board, the Examiner asserted that 

The skilled artisan would have been motivated to isolate the 
nucleic acid sequence corresponding to NAIL, based on 
Valiante's disclosure of p38 (which is the same protein as 
NAIL) and Valiante's express teachings how to isolate p38 
cDNA by using conventional techniques, such as taught in 
Sambrook, including using mAbC1.7, a probe specific for p38. 
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Id at 1412. 



This finding of obviousness in Kubin appeared to be predicated on the obviousness of 

isolating NAIL cDNA, not any other nucleic acid sequences encoding the NAIL protein. The 

Board specifically states 

Based on our findings and those of the Examiner, at least one of 
Appellants' claimed polynucleotides would have been obvious 
to one of ordinary skill in the art at the time Appellants' 
invention was made 

Appellants argue the "cited references do not provide an 
adequate written description of the claimed nucleic acid 
sequences." (citation omitted). In so arguing, Appellants 
overlook the distinction between obviousness under §103 and 
lack of written description under § 1 12, § 1 . A single obvious 
species within a claimed genus renders the claimed genus 
unpatentable under §103. Thus, an obvious method of 
obtaining a single nucleic acid molecule encoding NAIL may be 
all that is required to show that the presently claimed genus of 
nucleic acid molecules is unpatentable under §103 

The "problem" facing those in the art was to isolate NAIL 
cDNA, and there were a limited number of methodologies 
available to do so. The skilled artisan would have had reason to 
try these methodologies with the reasonable expectation that at 
least one would be successful. 



The claimed invention is directed to a nucleic acid molecule 20-51039 contiguous 
nucleotides in length consisting of a reverse or forward strand of a region of SEQ ID NO:4. 
SEQ ID NO:4 is a genomic sequence, not cDNA encoding MDM2. This is very different from 
Kubin which involves deducing the NAIL cDNA sequence from NAIL polypeptide sequence 
and subsequently isolating this sequence. The art cited in the instant application was a cDNA 
sequence and a genomic clone containing chromosome 12 sequence. Considerably more and 
undue experimentation would be involved in identifying and isolating the genomic sequence 
and consequently the claimed regions comprising the recited noncoding regions of the MDM2 
gene than in identifying cDNA in view of a disclosure of a polypeptide sequence. The 
claimed sequence is between 20-51039 nucleotides in length. The Muzny sequence is 
150,579 nucleotides in length. No direction or frame of reference is provided as to where the 
MDM2 sequence could be located. Furthermore, although Vogelstein, disclosed the MDM2 
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cDNA, no direction is provided as to the location of exons, introns, 5' untranslated and/or 3' 

untranslated regions. The MDM2 cDNA was only 2372 nucleotides in length. Thus, there 

would have been a huge number of strategies and methodologies that one of ordinary skill in 

the art could have employed in attempting to obtain the claimed sequences. 

Appellant also notes that in the Examination Guidelines it is stated 

The key to supporting any rejection under 35 U.S.C. 103 is the 
clear articulation of the reason(s) why the claimed invention 
would have been obvious. The Supreme Court in KSR noted 
that the analysis supporting a rejection under 35 U.S.C. 103 
should be made explicit. The Court quoting In re Kahn stated 
that ""'[Rejections on obviousness cannot be sustained by 
mere conclusory statements; instead, there must be some 
articulated reasoning with some rational underpinning to 
support the legal conclusion of obviousness.'" 

This statement follows the holding of established case law, most notably In re Beasley 117 

Fed. Appx. 739, 2004 WL 2793170 (C.A. Fed 2004). In Beasley, the Court held that: 

The examiner and the Board have managed to find motivation 
for substituting one type of memory for another without 
providing a citation of any relevant, identifiable source of 
information justifying such substitution. The statements made 
by the Examiner, upon which the Board relied, amount to no 
more than conclusory statements of generalized advantages and 
convenient assumptions about skilled artisans. 

A similar situation exists with respect to the instant application. An example of such 

conclusory statements made are on page 4 of the Office Action and shown in boldface: 

It would have been obvious to one of ordinary skill in the art at 
the time the invention was made to use said cDNA to identify 
the genomic DNA that encodes the human MDM2 homolog of 
SEQ ID NO:2 on chromosome 12ql2-14. The motivation is 
provide by Vogelstein et al. who teach that it binds to oncogene 
p53 and is diagnostic of tumorigenesis. The state of the art 
provides various techniques for obtaining genomic DNA 
using cDNA probes that are usually labeled. The 
comparison of genomic and cDNA would result in the 
identification of regions comprising exon-intron and intron- 
exon junctions within coding and non-coding regions. 



The Examiner has only referred to Watson, 1992. As previously argued, Watson 
would not apply in this case since in Watson, the genes themselves were actually cloned. No 
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other references were provided by the Examiner in response to Appellant's subsequent 
arguments. Furthermore, there is nothing stated in the Office Action as to how the state of the 
art teaches how given the teaching of a large genomic clone and the cDNA sequence of a 
particular gene, one of ordinary skill in the art could with particularity identify specific exon 
and intron sequences of a particular gene and assemble it in its entirety. There is no prior art 
that defines the complete genomic structure of a particular gene. This is necessary in order to 
accurately identify the claimed noncoding sequences in the instant invention. 

C. Secondary Considerations 

Appellant also asserts that secondary considerations should also apply when 

determining if the claimed invention is obvious over Muzny in view of Vogelstein. One 

secondary consideration of particular relevance is long felt need. Specifically, there has been 

a great deal of interest in the scientific community in MDM2 given its potential use as a 

diagnostic and therapeutic agent. This interest is summarized in the cited patent, Vogelstein 

Additionally, Appellant submitted during prosecution and IDS listing four references relating 

to MDM2 (a copy of the 1449 Form previously submitted and made of record is attached 

hereto as Exhibit 3). However, there was absolutely no disclosure or suggestion of the 

genomic organization of MDM2 genomic DNA until the instant application was filed. An 

independent disclosure of the genomic organization of the MDM2 gene was not available 

until July 21 , 2004, more than one year after the filing date of the instant application (Liang et 

al., 2004, Gene 338:217-223, submitted herewith as Exhibit 4). The dearth of knowledge 

regarding the MDM2 gene was actually admitted to by Liang et al. on the first page of his 

article where it was stated "Although the human MDM2 cDNA sequence has been reported, 

the genomic organization of the human gene has not been documented". Further, Liang et al. 

discussed the usefulness of determining the genomic structure and organization of the MDM2 

gene on page 218: 

One of the distinctive properties of MDM2 is the possession of 
an extremely complex expression pattern. Its multiple-sized 
transcripts and proteins have been found in tumour samples and 
cell lines by a number of groups (citations omitted). In our 
previous studies, five alternatively sized transcripts of the 
human MDM2 were found in human ovarian tumour, bladder 
tumour and leukaemic cell samples (citation omitted) Here 
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we present data demonstrating two further MDM2 transcript 
forms with internal sequence deletions in human tumour tissue. 
We Hypothesised that these transcripts are generated by 
alternative splicing. To test this hypothesis and to explore the 
associated mechanisms, we have investigated the genomic 
structure and organization of the human MDM2 gene. . . 

Conclusion 

It is Appellant's position that claims are not obvious over Muzny in view of 
Vogelstein. It is further Appellant's position that the claims are in condition for allowance. 
The Examiner is invited to contact the undersigned at (914) 712-0093 if she has any 
questions. 



Respectfully submitted, 



Date: March 2, 2008 /Cheryl H Agris/ 

Cheryl H. Agris, Reg. No. 34,086 
P.O. Box 806 
Pelham, N.Y. 10803 
(914)712-0093 
Customer No. 25538 
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CLAIMS APPENDIX 



7. An isolated nucleic acid molecule 20-51039 contiguous nucleotides in length consisting of a 
reverse or forward strand of a region of SEQ ID NO:4, wherein said region is selected from the 
group consisting of a 5'-non coding region depicted in nucleotides 51039-41739 of SEQ ID 
NO:4, a 3'-non-coding region depicted in nucleotides 9503-1 of SEQ ID NO:4, a contiguous 
intron-exon region between nucleotides 41738-9502 of SEQ ID NO:4, wherein a sequence 
segment comprising 41738-9502 of SEQ ID NO:4 encodes human mouse double minute 2 
homolog depicted in SEQ ID NO:2, a contiguous exon-intron region between nucleotide 41738- 
9502 of SEQ ID NO:4, wherein a sequence segment comprising 41738-9502 of SEQ ID NO:4 
encodes human mouse double minute 2 homolog depicted in SEQ ID NO:2, an intron depicted in 
nucleotides 36385-40645, 36309-33127, 32994-29616, 29564-25577, 25507-25384, 25287- 
21169, 21006-14110, 13953-13267, and/or 13188-10665, a region comprising a dinucleotide of 
the following group: 41739-41738, 40645-40646, 36309-36310, 36384-36385, 32994-32995, 
33126-33127, 29564-29565, 29615-29616, 25507-25508, 25287-25288, 25383-25384, 25576- 
25577, 21006-21007, 21168-21169, 14109-14110, 13953-13954, 13266-13267, 13188-13189, 
10664-10665 and/or 9504-9503; a transcription binding site selected from the group consisting of 

BINDING SITES huMDM2, location in SEQ ID NO:4 

AP1_C: 36-46,2876-2886; 

AP4_Q5: 7944-7980; 
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AP4Q6: 



7943-59, 8924-8940, 9294-9310; 



CDPCR3HD 01: 



1682-1706, 2193-2217, 9201-9225; 



1040-1058,7803-7821; 



1424-1438, 3917-3931,4178-4192,4787-4801, 6855-6869; 



DELTAEF1 01: 



83-95, 6328-6340; 



2757-2773, 5154-5170, 5823-5839; 



4846-4858, 7017-7029; 



6045-6057, 6073-6085, 6142-6154; 



2489-2501, 3323-3335, 3384-3396, 7393-7405: 



3264-3276, 6870-6882: 



40-52, 5729-5741, 6529-6541, 6874-6886, 7041-7053, 7589-7601; 



349-7361,8188-8200; 



1743-1759, 7995-8011; 



502-518, 1739-1755, 4160-4176, 9402-9418, 9418-9434; 



8184-8200; 



951-963,3588-3600; 



1202-1210, 1447-1455, 4997-4005, 5424-5432; 



NF1Q6: 



1480-1500,8166-8182; 



NFATQ6: 



4190-4208,6009-6027; 



NKX25_01: 741-755, 1648-1662, 1885-1899, 1984-1998, 3609-3623, 4928-4942, 

5060-5074, 5889-5903, 8850-8864, 9190-9204; 



2584-2599, 2970-2984, 4644-4658, 5179-5193, 6482-6496; 



RORA1 0 1 : 220-238, 2638-2656; 

S8_01: 4644-4656, 4842-4854, 4845-4857, 5200-5212, 5371-5383, 5735-5747, 

6482-6494, 6541-6553, 6544-6556, 6772-6784, 7270-7292, 7273-7285; 

SOX5_01: 1355-1371, 1430-1446,3094-3110,3155-3171,4669-4685,4692-4708, 

4789-4805; 

SRY 02: 4164-4180,5665-5681; 

TATA_01: 1261-1277, 2574-2590, 2723-2739, 2733-2749, 2770-2786, 4199-4215, 

4206-4222; 

TATA_C: 5900-5916, 7456-7472, 7702-7718, 7917-7933; and 

XFD2_0 1 : 7702-72 1 8 , 79 1 7-7933 ; . 

a transcription binding site selected from the group consisting of 

BINDING SITES huMDM2, location in SEQ ID NO:4 

AP1C: 12109-12119, 12695-12705,22600-22610,24166-24176,31311-31321,35234- 

35244,39184-39194; 
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AP1Q2: 11952-11962, 12068-12078, 14798-14808, 21748-21758, 22613-22623, 23676- 

23686, 26562-26572, 30046-30056; 



AP1Q4: 
39198; 



12695-12705, 31311-31321, 35234-35244, 36295-36305, 38784-38794, 39188- 



AP4Q6: 31635-31651; 

BRN2 01: 13448-13466, 14764-14782,28094-28112,40027-40045; 

CAAT_01: 11288-11302, 15054-15068; 

CDPCR3HD_01: 11286-11304, 13284-13302,20846-20864,29344-29362; 

CEBPB_01: 29241-29255; 

CREL_01: 36091-36103, 38873-38885; 

DELTAEF1 01: 18083-18095, 20385-20397, 26955-26967; 

FREAC7_01: 11982-11998, 15187-15202, 16523-16539, 16529-16545, 16587-16603, 16604- 

16620, 16676-16642, 16633-16649, 16644-16660, 16650-16666, 16657-16673, 16673-16689, 16762- 
16778, 21332-21348, 25689-25700, 26529-26545, 27767-27783, 29495-29511; 



10916-10928, 15775-15789, 18162-18174, 26088-26100,32518-32530; 
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GATA103: 



28012-28024; 



GATA1 04: 11153-11165, 11630-11642, 13778-13790, 17439-17451, 19300-19312, 21606- 

21618, 22743-22755, 23747-23759, 25806-25818, 26529-26541, 29424-29436, 30455-30467, 32761- 
32778, 33352-33364, 33960-33972, 36101-361 13, 40007-40019; 

GATA1 05: 1 1590-1 1602, 26550-26562, 36737-36749; 

GATA1_06: 18772-18784, 23054-23066, 35568-35580, 37855-37867; 

GATA2_02: 20755-20767, 30830-30842, 34755-34767, 36285-36297, 39143-39155, 39641- 
39653, 40586-40598; 

GATA2_03: 13535-13547, 22711-22723, 23161-23173, 25028-25040, 27237-27249, 36277- 
36289; 

GATA3_02: 11558-11570, 16470-16482, 17225-17237, 19619- 19631, 22156-22168, 22443- 

22455, 24713-24725, 27619-27631, 32716-32728, 34124-34136, 34163-34175, 36832-36844, 38403- 
38415; 

GATA3_03: 10869-10881, 11515-11527, 13845-13857, 17221-17233, 18952-18964,20050- 
20062, 40171-40183; 

GAT AC: 15848-15860, 18899-18911, 23640-23652, 29072-29084, 30881-30893, 33198- 
33210, 37472-37484, 38621-38633; 
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GFI101: 



35469-35481,35492-35504; 



HFH2_01: 15939-15955, 24636-24652, 25866-25882, 32171-32187, 35372-35388, 39457- 
35473; 

HFH3_01: 13340-13356, 19218-19234, 21328-21344, 21336-21352, 21344-21360, 28062- 
28078, 32125-32141; 

HFH8_01: 14133-14149, 22578-22584; 

HNF3BJ3 1 : 1 3 150- 1 3 1 66, 1 6505- 1 652 1 , 25264-25280, 29443-29459, 37654-37670; 

IK2_01: 11547-11559, 17144-17156, 18961-18973, 23883-23895, 27617-27629, 28908- 

28920, 29241-29253, 30752-30764, 34768-34780; 

LYF1 01: 12319-12331, 19191-19203,37226-37238,39430-39442; 

MAXOl: 22974-22986,33339-33351; 
MZF1 01: 26105-26113, 35187-35195; 

NF1_Q6: 12048-12064, 33334-33354; 

NFATQ6: 13295-13313, 14157-14175, 14311-14329, 14414-14432, 18269-18287. 19326- 

19344, 20801-20819, 21177-21195, 22537-22555, 23861-23879, 25392-25410, 25879-25897, 27524- 
27542, 30636-30654, 30718-30736, 31525-31543, 33655-33673, 34726-34744, 34917-34535, 34990- 
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35008, 35979-35997, 36479-36493, 36577-36595, 37154-37172, 40224-40242, 40365-40383; 

NKX25_01: 12041-12055, 12340-12354, 12471-12485, 12742-12756, 12877-12891, 13849- 

13863, 18995-19009, 21440-21454, 21883-21897, 28426-28440, 30964-30978, 32033-32047, 32265- 
32279; 

NKX25_02: 10998-11012, 12711-12725, 14131-14145, 14726-14740, 16024-16038; 

NMYC_0 1 : 1 8753- 1 8765, 1 8754- 1 8766, 23076-23088, 30534-30546, 34400-344 12; 

RORA1_01: 13134-13152,22966-22984,24934-24952,33341-33359,34760-34778; 

S8_01: 11000-11012, 11977-11989, 12048-12060, 12051-12063, 13747-13759, 13923- 

13935, 13926-13938, 14676-14688, 14679-14691, 16026-16038, 16313-16325, 16316-16328, 17515- 

17527, 20756-20768, 20759-20771, 23154-23166, 23157-23169, 25198-25210, 25201-25213, 26651- 

26663, 27508-27520, 27511-27523, 29450-29462, 29478-28490, 29775-29787, 29778-29790, 29813- 

29825, 29816-29828, 31329-31341, 31677-31689, 31680-31692, 31732-31744, 31735-31747, 36137- 

36149, 36140-36152, 36812-36824, 36815-36827, 37413-37425, 38679-38691, 39474-39486, 39477- 
39489; 

SOX5_01: 27397-27413,27572-27588,28100-28116,29230-29246,29439-29455,30690- 
30706, 31595-31611, 33871-33887, 341 13-34129, 34624-34640, 37668-37684, 38582-38598, 39124- 
39140, 40410-40426; 



SRY 02: 20016-20032, 22410-22426, 27329-27345, 29162-29178, 29499-29515, 30646- 

30662, 31503-31519, 35928-35944, 37324-37340; 



TATAOl: 32722-32738, 32729-32745, 32807-32823, 33825-33841, 34120-34136, 35433- 
35449, 36593-36609; 

TAT AC: 11015-11031, 11817-11833, 13635-13651, 14930-14946; 

TCF11_01: 18543-18549,22574-22580,31281-31297,31489-31505,38754-38770; 

USF_01: 23075-23087,32577-32589; 

VMYB_02: 11526-11538, 17384-17396, 18400-18412, 19549-19561, 22188-22200, 40486- 
40508 and 

XFD2_01: 16620-16636. 18153-18169,22102-22118,23141-23157. 
And a transcription binding site selected from the group consisting of 
BINDING SITES 

huMDM2, 1 location in SEQ ID NO:4 

AP1_C: 44584-44594,49069-49079: 

AP1_Q2: 42174-42184, 45217-45227, 48422-48422, 50447-50457; 
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42702-42712, 50806-50816; 

42117-42133, 421 18-42134, 42244-42260, 45432-45448; 45433-45449, 46609- 
46625; 

42310-42328, 44022-44040, 47514-47532, 48900-48918, 48967-48985; 
44866-44880; 

45671-45689, 49219-49237; 
42437-42449, 49797-49809; 
47026-47042, 47292-47308, 47658-47674; 
43482-43494, 48926-48938, 49284-49296; 
47371-47383; 

43054-43066, 43162-43162, 43967-43979, 45464-45476, 45916-45928, 47763- 
47775; 

49319-49331,49459-49471; 
47590-47602; 
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42660-42672, 43475-43487; 
43714-43726, 50948-50960; 
49155-49167, 49844-49856; 

42202-42214, 44810-44822, 48438-48450, 49136-49148, 49337-49349, 49869- 
49881; 

44011-44023, 45256-45268, 45823-45835, 47915-47927, 49201-49213, 49573- 
49585; 

46606-46618,47063-47075; 
47030-47046, 47284-47300, 47288-47304; 
45275-45287; 

44564-44576, 46991-47003, 49567-49579; 
43234-43246, 48726-48738; 

41772-41780, 42290-42298, 42295-42303, 44507-44515, 45105-45113, 45203- 
4521 1, 49948-49956, 50774-50782; 

50209-50229; 
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NFATQ6: 



42061-42079, 44418-44436, 46399-46417, 47974-47992, 49267-49285, 49964- 
49982, 50392-50410; 



NKX25_01: 42394-42408, 43507-43521,46115-46129; 



RORA1_01: 45073-45091, 48718-48736; 



S8_01: 43552-43564, 45214-45226, 47160-47172, 48419-48431, 49295-49307, 50379- 

50391; 



43716-43732, 46351-46367, 47156-47172, 47774-47790, 47868-47884, 
47974-47990, 48915-48931, 50323-50339; 



45588-45604, 47625-47641, 48026-48042, 48659-48675, 49056-49072, 49079- 
49095, 49152-49168; 



TCF11 01: 49115-49131; 



42010-42022, 42279-42291, 44651-44663; and 



42870-42886,42910-42926. 



10. A composition comprising the nucleic acid molecule of claim 7 and a carrier. 
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15. A kit comprising the nucleic acid molecule of claim 7. 

16. The kit according to claim 15, in which the nucleic acid molecule is labeled with a 
detectable substance. 

17. A solid support comprising the nucleic acid molecule of claim 7. 

18. The solid support of claim 17 wherein said support is a microarray. 

20. The solid support of claim 18, which further comprises a nucleic acid molecule encoding 
human mouse double minute 2 homolog, complementary sequence thereof or a portion of 
said nucleic acid molecule containing at least 20 contiguous nucleotides. 

22. A method of identifying variants of SEQ ID NO:4, or its complementary sequence, 
comprising 

isolating genomic DNA from a subject and determining the presence or absence of a variant 
in said genomic DNA using the nucleic acid molecule of claim 7. 

23. A method for detecting the presence or absence of SEQ ID NO:4 or its complementary 
sequence in a sample, said method comprising (a) contacting the sample with the nucleic acid 
molecule of claim 7 and (b) determining whether the nucleic acid molecule binds to said 
nucleic acid sequence in the sample. 
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24. An isolated nucleic acid molecule 20-5000 contiguous nucleotides in length consisting of 
a reverse or forward strand of a contiguous exon-intron region between nucleotides 41738-9502 
of SEQ ID NO:4, or contiguous intron-exon region between nucleotides 41738-9502 of SEQ ID 
NO:4, wherein a sequence segment comprising 41738-9502 of SEQ ID NO:4 encodes human 
mouse double minute 2 homolog depicted in SEQ ID NO:2. 

25. The isolated nucleic acid molecule of claim 24, wherein said nucleic acid molecule is 20- 
5000 contiguous nucleotides in length and comprises nucleotides 41739-41738, 40645-40646, 
36309-36310, 36384-36385, 32994-32995, 33126-33127, 29564-29565, 29615-29616, 25507- 
25508, 25287-25288, 25383-25384, 25576-25577, 21006-21007, 21168-21169, 13953-13954, 
14109-14110, 13188-13189, 13266-13267, 10664-10665 and/or 9504-9503 of SEQ ID NO:4 or 
their reverse strands. 

30. A microarray comprising a plurality of the nucleic acid molecules of claim 7. 

31. The microarray of claim 30 wherein said microarray further comprises a nucleic acid 
molecule encoding human mouse double minute 2 homolog, complementary sequence thereof or 
a portion of said nucleic acid molecule containing at least 20 contiguous nucleotides. 

32. A method for detecting the presence of a nucleic acid sequence of SEQ ID NO:4 or its 
complementary sequence in a sample, said method comprising contacting the sample with the 
nucleic acid molecule of claim 7 and determining whether the nucleic acid molecule binds to 
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said nucleic acid sequence in the sample. 
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Human Chromosome (Chr) !2qt3 is a region ofcimkal interest m 

t vaoet ise j - t - " 1 

if. i A rjumht ! ailed genetic mips of the regt n 

around human Chr 12ql3 arc now available. A second -generation 
YAC coring map of human Chr 12, which extensively covers this 
atea. has also been produced. This has resolved a number of dis- 
N \C contig 

map loin in " s ^ 

region between the polymorphic mierosateilite markers D12S333 
and D12SJ06. A. number of expressed sequences have also been 
localized vith v ig dtp " P » lo region arouna 
!2qI3. 

Jn an attempt to generate YAC resources with low levels of 
, s t < f- ,nn in this gc 

)< 130 YACs from the 
ICI library (Anand et al. 1990). We have used 36 genetic markers 
and U ESTs from five previous Chr 12 maps (.Cuyer and Cann. 
1992; Weissenbach et al. 1992: Schoenmafcers a a!. 1994; Kueher- 
iapatietai 1994; Gyapay et a! W°4 >u data can be integrated 
with the most e < • • 1 r > - ' 1 P 1 1 hi 

Krauser '95). V sve pped the WN1 ene (N« 
at. 199 U al 62-66 cM (based on the 200 CM t erail C in I ip 
off ra . t 1995) ooii iJ9<:i;nh 
gapbetw " ' '> ' > GPDigenc. This is 

a rep; wed in the ! nest Chr 12 cootig map. We have 

1 bservedct n < » \S >! .dGLl at 77 c.U 
m ihe interval between 0/255/2 and D/.7S90 (Kramer ct al. 1995). 
[' he markers i 1W2 ' ! * 

area between I>/2cY.?90 and D/25i95, which has enabled us to 
pi io «T t neralto! i ne ' r o ! - 1 < r T1 * to 
these points. We have isolated YACs for LRPM2MR, which is 
known lobe physualh - nkedtc 01 1 (S is tnd Mykh x>st 1992} 
These may be of value m closing a gap at 76 cM in !hc current map 
immediately centromere: of GADD153/GL1. The YAC UGH7 
containing tM2i,312 mas also he useful m this n^pt 1 * as ss the 

osest known marl >x>ma1 i a nt o - tarl 
YACs contau U IDl na help to ose ih gap 

between RAP18 and PI2S80 at 85 cM. As well as YACs con- 
taining COL2A1 and LALBA, which are included in the latest 
p. we have isoaited ^ \C , lot PM< * ( D63 and PA81, 
which are as yet not accurately positioned on the map. DI2SS9 or 

! ! um ^Cs described her re new t 

di taJ to i rreg m oi Merest at 1 13 cM and 118 cM respectively 

1990) has enabled us to generate oihet novel STSs and eorrespond- 
\ ^> i 

Novel YACs identified with 36 genetic markers from (he 
12ql3 regt - >ress< piences in the region too 

kj own. a corresponding YAC from 
the second-generation YAC cootig map of human Chr 12 is in- 
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eluded in Table 1 for ease of cross-tt fence (Krauter ct I ?95) 
The 47 loci screened al! produced at least one positive YAC and, 
u, most ases <■ ultip e t VC*- t sistoi'o 1*31 loidg, omn. 
rantesentation of ibis library (Anand ct al. 1990). In a number of 
case:;, the same YAC was identified with more than one genetic 
marker, implying that these markers lie in close physical proximity 
, , - it sen - of the YACs m this library is 350 kb). 

D12S339 and the WM 1 onco| t x> iUve signals 

with YACs 26BB4, 28A04. and 32P ■ r ^ on < t the 64 <M 
connggae f ' ' d 68 cM both 

identified YAC 30AD1 1. D12S17 and D;2S96 both identified the 
YACs 20BB4 and 20BF12. Th pla DUST? at the 71 cM locus 
tn the. middle of the KRT gene family and ce»tromeric of 
D12S398, DI2S359, D12SJ9, D12S325, HOXC5, and D12SJ03 
tcspeciively. G ADD 153 and GUI at 76 cM idewtiry the YAC 
6EC. i, co in h< lose physical linkage of thest wo genes 
This locos is immediwely telomeric of a gap at 76 cM in the 
current second-generation Y AC contig map, and end seqw • * 1 - 
of 26EG10 mav "assist in contig closure. The markers D12S305 and 
y ihe \ \C iOEDl concot 
nd-i i. » t ' - itiencing of lOEDt and 
ng of the YAC library idci >d 1' M r 't VA< 
comaiti '•-:< ronfiro ig t arfcei ord i D12S90 D 2S 05 
DI2SW4. D12SS3 and ) 1 - YAC 34CA7, 

confttmrng their physical proximity at 77 cM also (Krauter et al. 
1995) . l 

YACs for which sequencing of human DNA insert termiBi has 
been per ed ho vn in 1. Follow confirm o > 

localization to Chr 12 with the C one' M - n ! ' i . < tional 
d f I ' ' s ' 

covcvaE'e at D12S96/17 (71 cM, 20BP12), D!2S90/30$/m (77 
• i iOEDl, 55BE5), and DI2S43 around the 86 cM region 
(34HH10). A number of YACs proved to 1 is 
analysis; the right-hand end of 40AAI2 (a YAC identified usi v 
primers 1D.M man < fhe MDM2 

1 FISH analysis (d 

shown) Tit V'n 2 C 40< I metic b> FISH 

analysis and was used to confirm its localization to the 12ql4 
reeicn (Pia. 2; Heiuhwav et al. 1994). The other YAC termini 
n Table 1 id" tpt < ir 12 by PCR analysis oi rmasic 
ceil hybrids. 

in" the last four years, six separate Chr 12 maps have been 
presented which include markers in and around the 12qt3 region. 

IH BPH < lab mapping group comprehensive ge- 
netic, linkage map of the human genome (Guyer and Cann 1992), 
tjunct with • V( oi i Sei sociates (5994}> 

provided an approximate marker order cen-COL2Al-El.Af- 

IS-D12S25-DI '" c > 12 
D!2S!?-D!2S6-D!2S2S-D}2S22~D!2S28~DI2$!9-Di2$43- 
D!2S$-D12S64-D!2S'7-k\. with the latter probably located dis- 
tally in 12qli In comph 9 
and colleagues described another set of markers across the same 
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.Table 1- YACs id 
"dstacied by (note 
I' reference- YACs is 



;ii makers and EST:;. 



:hidtai (Riley a a;. I 

5 !t>i\ c rosen, 

!. F mil inn H ; 

n«s. P< 1, c ai descntx V.C 

screnwd to owtttna Ux»iimic« <" bstnan Chrii 



■ > ic ted 1 1 



Tabfe 1. Ctuvias&it 



Tl ' -T £ «Ctft TTA9W 

TA T'C 6AS OiO «T H A TTC CAG WT CAt CAC AA? CAS CAS 
ChmM *M TA6 AAA «AT CAS AAA AC* CST «K TAT T W 6GA ATA AT* TAT rt » CM 

12 tsjfl 6W. AAA as, TAT«TT ATA ACT sag AAS AH ACC Wf T.J- 







i itot. 
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Markers 


Tsk CQ 
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CEPM VACs 




5S.5 




20B84; 20MM3 






5S.0 




34BH10 . 




0/2KS 


53.0 




. ID: J71DS :.•<.*•: 




W2S» 


53.0 


!-S 


IS 92; SJGDd 






55.0 


2.0 


SFfiO issegi 9dh h 12 










34DD7; 37FC5; 3SC<iS 




oms3 


50.0 


K5 
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53,5 


20 


SSA12: I2HC4; 12(1:3: 1SCH4, 


6»l, ?6'f:S 








2IGB5. .3" EH 10. 




mm? 


520 


2.0 


!HG9; iOiGS, !0)Grj, 23BAIO, 


75<>b7 








37AE2. 37FA3 




D/zm 


53.0 


13 


400S5: PEGS: ISBKS; 23!Df; 










34HA8; 361 €2 






53.0 


2,4 


wmi; MBfU 




Di2Si«3 


53.0 




7DF1 


-g0f7 


O12S104 


zl'i 


GO 


)0£.»J; 30AA4; SJEH4 




Dlii'lM 




! ■ 


33FHS; >5i)F4. 391H2 


763a8 




"n 




SHBIJ; 18DA9 




Dl Sl.il 


54 5 


15 




- 


072675? 






40A« 










»AOU 


717gS 


0/257705 


St. o 


i,S 


lOEDli 20HH9; 3S8C9 


9£i7f7. 




5 -JO 






790f7 


Di2S-!i! 


Si.O 






75)34 








Saiil2. 8EC4-. 24BF5: 3?SB! 


nz*n 


1 i2S , 


54 S 


1.S 


30F&5 


S«..:(7. 




565 


!.S 


HOGU; |«r>!> 24FH7 






51.5 


!.S 


OGM. iiH», 40GB !2 




onsuj 


53.0 


IS 


SHGS; i2GG-; S7GH4; 1 9DD7 . 


352^6 








341353* 










34CA7' 35!i>6 


75547 ' 


onshs 


5:5 


15 








55 0 


i.5 


. s - - ; 


925bJ2 




57.5 


).S 


27BK? 
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omits 






OFTI ;<> . Ri • 
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33!iH!2, 371M1 




OUS.W 


53.0 




HSOT 










16S31: 24Ci;n. 77 EES; 2s ; Or3; 


gi«n 








3JCCG-, 3'SHB7; UK r.i 




omiss 






39C38 


C.-G'a-S 


DI2SHI 


%i 




26BG9; 40CD1 ! 
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7;7gS 
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9CBII: HHHfe 23CCH); 37HC50 
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Tin 









; . i! 
MSG 10 

26EGI; 2SEGI0 
26BB4; 2SAIM; 3JBC2 

fin>ii.40AAi2;40C35: 2: 

130G9 

26A88; 27GEU: 33AG6 
4AE4 



A-X, A^C TAC ATA :T-. «k ATA .V!A AT? AAA AT ' C'C UAA AtA «TG AC-:. .V.v 
TTT CAA m AAS TTA CAT ATC ACT ATC TTT TM Ml AAfi MT SCA !«A TI8 
8S CAS WAT- 3' 



YACdone Left end » 



- a:a -TT 6: a TTT ItX TO'"- K : 1 AC 'AAA Aa: ATI CAA C^T TT 
1 TUAACTTT f Ay AAA T'A AT' AAA TCT CAC t TC AAA TTT 6TS TK 

T 6TA TM W «A 6TA MS ATT 6*5 AIT 6 



TrASAOCAATA 



34fBU8 



region from Dl >S87 to D! 25 106. 3 regio i . i that «age tbouahl.to 
be of around 35 cM. The reported order of markers was cen- 
0 1 2SS7-D ! 2S85-D1 2S96-D 1 2S 1 03 -D 1 2 S 90-O1 2S 2 04- 
I t I. ft was not 

clear at that time how these two different sets of markers over- 

pp 1 u» 1 3 v. in the report o 

ffiternational Workshop on Human Chromosome 12 Mapping 

[ mm Oenethoo Tian get - iF J 
i94) tit n spanded 45 cM 

region flanked bv D/25S7 3rni D12S106. Hiis map was integrated 
with that of Weissenbach and ecu »it e; ( i 99 1 1 M< >st recently, a 
secOBd-gcnmtion YAC contig map of human Chr 12 Jjas been 
produced (Kramer ei aJ. 5995). This represents an almost complete 
nap< I * K > resolves mat 

T t ri a : 

the contig map places D12S8?, previously viewed as a 12qll 

t: 5 c ; on I2pl 

The marker 1>12SJH is now positioned at 55 c.M just below the 
centromere on the q arm. However, this latest map still contains 
five gaps in the region we have been investigating sad does not 
is ludt nany fi -'eneraUoa genetic markers, some of which can 
now be placed on the map. 

While the second-generation YAC contig map of the I2ql3 
n .i n n h , i< t d as definiiivi i .■>--' i ,r * , 
th reir v t r previous gens ic maps a > iberi in this 

paper allows as to draw some useful additional conclusions. Work- 
i Muere to qter. tl T^T (ATP5B) mapping in 

the 12c 12 region f Kucheilapati et 3} 1994) that :t ttneiuded 
oiiti" map n> . sumabiy i > ■? en the cemro 
mere (54 cM) and the LALBA/O/25/20/COL?.A!A'DR/O/27SS5 
locus, which is now well resolved on the physical map but was not 
clearly discriminated on the genetic map. It is iherefore possible 
gap n '! e (.arren' VAC cont g at 
around 50 cM. The marker DI2S3 i9 which was included between 
DJ2S85 anO I I I - n the 1993'^4 

Genethon map, is not included in the 1995 YAC contig map. We 
have now sno- i t0 

WNTL tt seems probable that these two markers map in the sec- 
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owl contig gap around o4 cM on proximal 12q. The 1994 Chro- 
! < p snap Sinks WNTi with Ik2S!3I, D12S6, 
D12S!?, and £7257/ (Kacherlapati ct a!. 5994), Our data contra- 
- - • , a t •raiil tight physical tink- 
le between OiZSI' ~ \ er now mapping more 
17 cM in the middle of the keratin gene cluster. Nev- 

t h v 'hil to screen YAC libraries with 

D/256. 0/2S7/ (0i25/i/ is reported herein), because it is con- 
ceivable thai these may also lie in the 64 cM \2q YAC conttg gap. 
!-t i. ta • 5cho ro ko i kc also valuab 

regard. They localized a series of fvrsi-generatfcra Chr 12 markers 
to the region between COL2A1 and £>/2S/7. Their approximate 
louis o-der was een-H \i D!2i 

> i n?. >!2$l?~lcl They a! so mapped D/ Sd 

distal to 0/25/7. which by our data would .also place it distal to 

r . ' ' 1 > . I ( 

However. ELA1, D12S29. D12SIS, D12S2S, W2S14. D12S4. 
D12S18. and D12SI& would be potential markers to use to identify 
YACs in an attempt to fill the second YAC comig gap at 64 cM. 

Our physical linkage of 0/2577 and D12S96. plus the 1995 
(o altKt! U t 71 cM <Ki ut r « ^ 1995) would 

suggest ibv comparison of the 1992 NiH/CEPH map with the data 
Z ; ( '252*. and 0/2572 map 
between D12S96 said 7W2S/9. This can now he easily confirmed 
because the 'V A- coal is c mpSen stween these points \gam, 
the 1992 NfB/CEPH map and Schoenmakers and colleagues 
(1994) have previously placed the markers D12S43, D12S8, 
> \ '0 The 1^94 workshop report 
(Kuchcrlapati ct al. 1994) linked D12S8. DI2S43. and LYZ with 
012 i Inch i o i» i*> > 3 « thi V AC ontig at 8? cM. 

i > > *C eonfig at 

85 cM. 0/255, D12S4X and LYZ are useful markers to screen 
YAC libraries in an attempt to bridge the 85 cM contig gap be- 

t , < V i ' 1 > 
port (Kacherlapati et al. 1994) links the fFN-? and RAPSB loci 
with DI2SS6. Again, Dl 1SS6 may hi; use u 
(hk teg o» Moiw.a rr.au ai or! 1995 napp 
t • MDM2 < i close , e io IPN-y, bu MDM2 - - i 
not included on the 1995 cottttg. It ss. therefore, also possible thai 
MDM2 J i ' wiecootij p.! pport f this -he MDM2 
YACs differ from our DS2S313 YAC immediately proximal to 
IFNy. This suggests that MDM2 may be distal to IFN-y. While two 
of ihe Y-V. described hen no ' 1 \ chimeric 

(6FD1 1 and 40AA12). the YACs 40CB5 (Fig. 2) and 26DC3 
H v i> et al. 1994) have been localized by FISH analysis 10 
12q 14 %-q\ 5 r it:g gap. This 

cytogenetic localization is in agreement with the reported local- 
ization of IFN > i J ' f tlie mapptn! 1 <■ 
Slightly dit "J s D12S213 >«1 D12S I to i -^S 4 q! 1 . 
Fejzo and a 1995). h ntcrescmg that the previ »osly 
noted co-ai-npl.fi canon of the GU. CDK4 and MDM2 genes in 
human sarcoma (Khatibet 1 199 > i stent with th 
genetic map of this region. The YACs for D!2S3$Q and 0I2S326 
i 1 { a fifth gap. \> , M 
in the current con tie reap. 

D12S64 may be sor e <a j> distal to the &i2S8 , 

) locus at* !2ql4-<3 - a 1 <> N 

t >S cM fthn.h we 

have taken as the boundary of o cn Similarly, 

D/2.yi«, for which three YACs are described herein, now maps at 
113 cM, bevo« i s end of the 

region, D/2.C.S9 and DJ2S345. both map adjacent to the centro- 
mere on the short arm of Chr 12 (Kucherlapati et al 1994; Krauter 
et al. 1995>. Our D12S87 YACs also map on the p arm. as do our 
most dastai YACs for D/7.5.13.?(!2p-| 1). 

Of the ESTs that have been used h tudyt generate YAC 
clones L \L-> KCO 2A\ o^Of>i 1 1 

honed on the 1995 YAi t around oft. 6 



cM respective! 1 IGF1 .* e, for which is > ufie . 
number of YACs, maps distal to i 2SW< a ai und US cM 
; YACs a 1 e included in t'abi ah t. r cir 
rf t ^ 1 ! r > * > i'ACs t i 
not yet positioned 011 the 1995 YAC contig map, the WNT1 clones 
promise to be »f value in thai they « . D -'J9 in a current 
corAtiggapat 64 - I I ones may map 

1 At t cM. At present, the pr 

, nsofPMCA.CDt I he. p calmapremai 

to be con urn id YACs lor the LRP' 

his been physically 

mapped 300 kb from GLI (Fonts and Myklebost 1992). Tiiose may 
be use n ng a third YAC i! ?ap in this regiott between 
0/252/2 and GADD153 at 76 cM, as indeed may the DI2S3I2 
, J . DD 'GLI YAt e< ril d - reu » t , ibility of a 
wide range of YAC reagents from the I2cen-12ql5 region will 
v i 1 ' ig exei 1 this area of the 

genome and should allow us to resolve some of the. 0 



015SM3 ~ 



i i lighung gaps in th 

mi ^ \< ontig 1 ' t- > At is tor some of 

which r'ACs are de 1 ' ' m. The ICI YAt ra y {Auatid et a! 

I t I P M t' 

nettc marl r 1 f ' ned 100 ng poolec 

r a ( 1( i ' bpri 1 1 ,i r(Prom 

0 ' i 1 Df isc (1 roe^a). and 1.5 
v > ii specific ea; shown m Table 1 where higK MgCl 
• ccssary. All PCRs were csmed oat *nb a Techne 

PH( b i cycle! iena arii K 1 n/95 r p rif - 

ON A \Q mi"S 1 1 lfil <oi hv 38 cycti s of 30 

, . 0 »i'h lfioa! elongation step of 10 

ffiinmx. 
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Fig. 2. Fluorescent in sira hvbndiittsiioti a 
of she MDM2 VAC 40CR5 confirming 
localization of this gene to 120,14- 15. 



questions concerning the frequent translocations encountered in 
the reg « ty of n la 
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The Gene Coding for tnterferon-y Is Linked to the D12S335 
and D12S313 Microsatellites and to the MDM2 Gene 

j. F, Bureau,* F. Bihl,* M. Brahic,*' 1 and D. Le Pasoer! 

*Unm o'es Virus tents, Imiittft Pasteur, URA CMS J 28 rwdvDr. Roux ?S?24 Pans Ce<kxJS. Frmce; md 
tCmm 1 c -o^me Human 27 rue Mme Dodu, 750W Pans, France 

Racew.i March 15, 1995; scented Apfil 27, 1935 



pendent diabetes meliitus and a control group. For fu- 

Interferon^ « a cytokine with multiple effects. It ture studies using the candidate gene appruacbyt 



interferes with the replication of several viruses and 
plays a key role in the t egldation of imnmne responses. 
Therefore, the gene coding for iaterferon-y could be 
implicated in the susceptibility of humans to several 
diseases. We have localized this gene close to the 
D18S33S andD12S3I3 mieroaatelliteB on both the phys- 
ical and the genetic maps of the human genome. We 



•odd be extremely useful to locate precisely the IFNG 
gene in particular with respect to polymorphic microsa- 
tellites markers. 

With this goal in mind, we screened the YAC library 
from CEPH (3) by PGR using two primers (forward 
5 '-GCTGTTATAATTAT AGCTTGT-3 ' and reverse 5'- 
AGGGTATTATTATACGAGCT-3') derived from those 



also physically mapped this gene , close *J*»M^ Ascribed bv Ruiz-Linares (17) for the IFNG microsat- 
locus on chromosome band 1 2«jl5. FmMy, we describe - rf ^ ^ „„ a v;* from Amersham. 

the organisation of the Ifg, Myf-6, Mdml, and Mdm2 ^* M ^^^J^Jf^^^^^ 
loci on mouse chromosome 10, in a region syntonic to 
human chromosome baud 12«J 5. o isss Aoutamte p«»s, fat. 



Interferon-? (IFNG) was discovered because of its 
antiviral property (12), although it is now studied 
chiefly because of its centra! rote in the regulation of 
immune responses. This cytokine is secreted by CD4* 

T cells committed to the Thl path way, by CDT T cells, D12S835 and DI2S313 
and by activated macrophages. Because of these mulii- wa g tested for the presence of the IFNG uucrosatelhte. 
pie functions, v t gen< cod • g for I PNG is a good candi- Eleven clones contained the IFNG mto'as&tellite. Ten 



After denaturation at 94°C for 2 mio, 200 ng of DNA 
in 25 u\ was submitted to 40 cycles of amplification 
(94*C, 49 s; 50*C, 40 s; 72°C, 15 s>. Using this procedure, 
we isolated clone 825G7 of the YAC library. We found 
that, clone 825G7 belonged to a coatig of 25 other YAC 
clones, according to inter Alu PGR patterns described 
in the CEPK-Genethon WWW server (6). All clones 
this contig contained either both microsatellites 
one of the two. Each clone 



date gene for control of susceptibility to vanous infec- 
tious as well as immune-mediated diseases of humans 
and other mammals. The persistent infection of the 
mouse central nervous system by Theiler's virus is a 
case in point. The ifg gene is a good candidate for con- 
trol of the persistence of the infection for two 



(i) Persistence is controlled by a gene that was mapped D12S313 microsatellites, 



also contained both the D12S335 and D12S313 a 
satellites (YAC clones 745A10, 743E2, 7S1A4, 809H4, 
823D1, 870H3, 924E4, 926A6, and 983H8), and one, 
YAC clone 763F1, contained only the D12S313 micro- 
satellite. Therefore, these results demonstrate that the 
IFNG gene is physically linked to the B12S33S and 



to the teiomeric region of chromosome 10, close to the 
Ifg locus (6). {»} Resistant 129Sv mice whose gene cod- 
ing for the IFNG receptor has been inactivated become 
susceptible (8). 

A precise localization of the human IFNG gene was 
not available until the present, work. This gene had 
been localized to band 12q24 using FISH and by screen- 
ing a panel of somatic hybrid cell lines (14, 19). Re- 
cently, Ruiz-Linares (17) described a microsateuite in 



To confirm the position of the IFNG locus on the genetic 
map, we analyzed the segregation of alleles of five micro- 
satellite markers in the eight families that have been 
used to construct the Genethon map (10). Besides mark- 
ers D12S336 and D12S3I3, we used two more microsatel- 
lites located on either side of the D12S335-D12S313 
region. One was taken from a group of six congrega- 
ting markers (D12SI04, D12S305. D12S355, DI2S334, 
D12SS3, and D12S329) centromeric to the D12&335 



the first intron of this gene, and Awata ct aL t3>re- ^ ^ other ^ & 0 f three aggregating 

ported a difference in the alleh ustnbutic t is ^ n , [t] D12 SS0, and D12S92) teiomeric to 

marker between a group of patients with msuhn-de- && m2 S313 marker. For each family, a pair of markers 

^^cvtwria****** b««M««*d.ft«:«(«*«l ™ ^ ^cording to its pol^orplusm within the 

31 67 B Ti , t family. The parental meiosts were not mformsfeve for 



:a28, lOS-m USSS) 

C»5»Tight © 1999 by Aiaiicraic Pttts, Ejk- 
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FIG, 1. Segregation of the alleles of IFNG assd fo«r 
gooot^es a< t} n r « " re obtained from the Genetbon database. D«wh«s wwrespeiri 
avaiJahte Stippled arefis indicate a TScDiiibmatittB 

" a S!ie of the amplified ONA. from the smaller to U» larger fragment 



two families. The pedigrees of the other six were analyzed 
for the markers described above: IFNG, D12S335, 
D12S313, and the two Banking microsateffifces. The re- 
sults are presented in Fig, I for two families and summa- 
rised in Table J No recombination was observed between 
the D12S313 and the IFNG markers among 137 informa- 
tive masses. Only one recombination was observed be- 
tween the M2S335 and the IFNG markers among 135 



informative meioses. Analysis revealed that the flanking 
markers were indeed less linked to the IFNG locus than 
the D12S335 and D12S313 markers. 

We next showed, bv physical and genetic mapping, 
that the DPNG locus is linked to the D12S313 and 
D12S335 loci. Indeed, YAC clone 926A6 was recognised 
by these three markers. This YAC clone is chimeric 
and contains regions of both chromosome 12 (band 
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TABLE I 
Genetic Localisation of WHO 





No. crossing over 
with IFNG 


Ho. of informative 


D12S329/M2S8S 


S<3.9cM> 


129 


012SS35 


1(0.8 cM) 




D1SS313 


O<9.0 cM; 


137 




6 (4.4 cM) 


13? 



12ql5) and chromosome 5, as shown by FISH (6, 9). 
Therefore, our results localize the IFNG locus to band 
12ql5, which is different from the previously published 
localization (band I2q24) (19). The region of human 
chromosome 12 to which we localize the IFNG gone is 
syntonic to a region of the mouse genome where we 
and others have previously localized the mouse Ifg gene 
{2, 5, 18). In our work, the pactions of loci Mdml, Ifg, 
and MyfS and of three mierosatellites DlOMitlO, 
DlOMitU, and BlQMit.164 from the Whitehead Insti- 
tute were determined using the progeny of a Fl (BIO S 
X S-JL/J) X B10.S backcross (Fig. 2} (5). This backcross 
was typed for more than 100 loci that were distributed 
among all of the autosomal chromosomes. The results 
showed that the Mdml, Ifg, and My/6 loci were linked 
to each other in the telomeric region of chromosome 
10. Other authors have also reported that the Mdml, 
Mdm2, Mdm3, and Ifg loci are closely linked (2, 18). 
Interestingly, the MDM2 locus has been localized by 
FISH to human chromosome 12q.l4.3 -ql5 (11). Thus, 
tht *nal> svfi i \U me region of the mouse genome 
agrees with our localisation of the IFNG locus to hu- 
man chromosome 12<;15. 

We confirmed the physical linkage of the MDM2 and 
IFNG loci by screening, with the MDM2 PGR (11), the 
contig of YAC clones recognized by at least one of the 
D.12S335, D12S313, and IFNG mierosatellites. The 
MDM2 marker recognized three. YAC clones (75IA4, 



879B3, and 988HS) that were also recognized by the 
i and IFNG nucrosateOitWi Thas, 

the MDM2 and IFNG loci are physically linked. The 
MDM2 gene, which codes for a p53-like protein, is lo- 
cated in" a region associated with several cancers (15, 
16), New polymorphic markers for this region are of 
particular interest since the only one available so far 
is a NlalV polymorphic restriction site (11). 

The MyfS locus is located in the region of mouse chro- 
mosome 10 syntonic to human chromosome band I2ql5 
(43. Therefore, we decided to localize the MYF-6 locus 
physically. We screened the CEPH YAC library using 
MYF-e-specifie primers {forward 5 '-AGACCTTCTCC- 
ACGCAGC AG-3 ' and reverse 5 -GCGAAATCTGTTG- 
TQCAGCT-3 ') under PCR conditions identical to those 
described above for the MDM2 marker. Two clones, 
92 ICS and 982A6, were isolated. We found that both 
belonged to a contig of nine other YAC clones, according 
to i«ter-A f u PCS patterns described in the CEPH- 
GSnethon WWW server (6). Three of them, clones 
940B8, 937DS. and 949H4, contained the MYF-6 
marker. According to the CEPH-Genethon server, 
clones 940B8 and 937D9 contained the D12S106 micro- 
satellite, a marker 15 cM from the D12S313 microsatel- 
lite, toward the telomere {10>. Figure 3 shows the posi- 
tion of these various markers on human chromosome 
12q,15 and mouse chromosome 10. The organization of 
the syntonic regions is very similar. IFNG/D12S313 
and D12S106 are 15 cM apart, whereas IfgMdml and 
Myf8 are 5 cM apart. However, since 1 cM is, on aver- 
age, equivalent to 1.7 Mb in the mouse and 1.0 Mb in 
human, the physical distances between the markers in 
mouse and in human could be of the same order. 

In conclusion, we have shown that the IFNG and the 
MOM 2 genes are close to each other and to the 
D12S335 and DI2S313 mierosatellites. These markers 
are most likely localized to chromosome bands 
12ql4.3 ~ql5, in a region syntonic to the telomeric part 
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FIG. 3. Segregation of the alleles of Ifg, Mdm I, MyfS, and some raarlcere published by the Whitehead Institute ( < : m 78 PI (SJLfi X 
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Tslomars COfttromaws 
FIG. 3. Genetic maps of the syntenfc regions of human chromo- 
some 12ql5 and mouse chromosome 10. The MBMSand MYF6 genes 
were localize. • ppin j only The other human genes 

and markers were toealraed by both physics! and genetic mapping. 



of mouse chromosome 10. This information will be ex- 
tremely useful in human genetic studies in which the 
IFNG and MDM2 genes will be candidate genes. 
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Abstract 

The MBM2 pmto-onco^c « whic ! nds tothcpS3 tumour snpp'essoc has bet i 

in a range of toman turnouts. -Although the human M>*» cDNA sequence lias teen repotted, the genomic organisation of the human gene has 
[ , i u j ted r < v e< pts in human tunwnrs and 

sug , stu j K -i likictH i i mi Ki 

asenhe -hese variant transcript farms to alternative: splicing, mud to explore associated 
. i onof the human genomic sequen " i ^proximately 

35 kfe and is divided into i 2 exoos. Exon sizes range from 50 to Sjliol hp and intton sizes vary from 121 to - 7<X» bp. The positions of 
inuon-enon boundaries ate compared with the deletion junctions of the muitipfc-sized transcripts and discussed in relation to alternative 
;p ie mechanism. 

© 2004 Elsevier B.V. All rights reserved. 

Keymnds- pii, genomic mapping; lens nrage PCR 



1. Introduction 

The mdm2 proto-oncogene was initially identified as an 
amplified gene front a mouse double minute chromosome 
present in a spontaneously transformed Baib/C 3T3 cell 
line, 3T3DM (Haines et a!., 5994; Sigaias et a!,. 1996; 
Steinman et ai,, 2004). 'The causa! role of this gene in 
tumorigenesis was originally established by transfcction 
studies using genomic DNA sequences. In these studies, 



AU'-t-ujiir.ns MOM! ous doubt nmute : :«ae; RT-PCR. reverw 
transcription polymer a: ti i plero tai\ 'N 1 *!> 

Kiktuv bp, base pair; rii, n < i meth > eiM <« 

" * Database accession numbers: AFW4014-AFW4033 and Af20!310- 
Af201J7l, 
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experimental overexpression of mdml resulted in the 
immortalisation of primary rat embryo fibroblasts and 
induced a fully transformed phenotype in the cells when 
ootraasfecfttJ with an activated rus gene (Firtlay, 1993). The 
human homologue of the MDM2 gene has been found to be 
amplified ra over 30% of human sarcomas (Olmer st el., 
1992; Ussch et al„ 1993), which consequently results in 
high levels of the MDH2 gene product. In addition, MDM2 
overexpressiotj can also occur through enhanced transcrip- 
tion and tomslstioi * 1 no- et a! 1993; Landers et 
al., 1997; Moraand ei ah, 1998). 

The human MDM2 gene has been localised to chromo- 
some 12ql3~14. Although the human MOW cDHA se- 
quence has been previously reported (Glitter et a!., 1992), 
little is known about its g< - 1 i! on The MDM2 

protein is composed of 491 amino acids and contains a p53 
binding domain (codons 19- 102), a putative nuclear local- 
isation signal {codons 181 - 185), an acidic domain (codons 
^-aT^aceniral a ne -finger motif {codons 305-332) and 
a ring-finger motif towards the C-ierminai end of the protein 
(codons 438-478) (Boddy et al., 1994). 
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MDM2 appears to be a plutipotenliai oncoprotein, exert- 
ing its transforming properties through several alternative 
mechanisms, of which Are most extensively studied has 
been the negative regulation of p53 function. MDM2 blocks 
p53 tenscripiiorial function by tending to p53 (Montana et 
a!., 1992; GKrter et a)„ 1993). The binding ofMDVK to p53 
aiso results in the rapid degradation of p53 (Haupt et al.. 
1997; JCubbulat et al., 1997). In addition, MDM2 has been 
reported to have p53dndependent tumorigenfe properties. 
This includes the ability to internet with ami inactivate the 
pRb tumour sup 'iv r (Xiao 5 5, !Wi and to 
bind to and activate the E2F1 transcription factor {Martin et 
ai., 1995). Furthermore, two independent transgenic studies 
have shown MDM2 to have ttjmourigetiic properties in p53 
null mice (Luttdgren et ai., 1997; Janes et si t 1 998). 

One of the distinctive properties of MDM2 is the pos- 
session of an extremely complex expression pattern, lis 
multiple-sized transcripts and proteins have been found in 
tumour samples and celt lines by a number of groups 
(Haines ci at. 1994; Sigalas et al., 1996; Battel et ai., 
2002). In our previous studies, five alternatively sized 
transcripts of the human MOM2 were found in human 
ovarian tumour, bladder tumour and leukaemic cell samples 
(Sigalas et al, 1996). The expression of the alternatively 
Sized forms was found to be more frequent in tumours of 
advanced stage and high histological grade, and they also 
retained their ability to transform N1H3T3 ceils. Here, we 
present data demonstrating two further MDM2 transcript 
forms with interna! sequence deletions in human tutnouj 
tissue. We hypothesised that these transcripts are generated 
by alternative splicing. To test this hypothesis and to explore 
the associated mechanisms, we have investigated the ge- 
nomic structure an lorj anisati ,n <~<f the human MDM2 gene. 
This gene is ~ 33 kb in length and comprises at feast 12 
exons. Toe sizes of exons vary from 50 to Srllfil bp, and 
introns range in size from 121 to ~ 7000 bp. The position 
of intron-exon boundaries is compared with the sequences 
of the MOM2 variant transcripts and discussed in mlalton to 
alternative splicing mechanisms. 



2. materials and methods 

2.L Nested RT-PCR 

Total RNA was extracted from human bladder tumour and 
normal bladder tissues. Nested RT-PCR was earned out as 
previously described (Sigalas et al , 1996) 

2.2. Genomic DNA extraction 

Genomic. DNA was prepared from frozen normal hu- 
man placental tissue by digestion with proteinase K and 
phenol -chloroform extraction. The DNA was precipitated 
with a half volume of 7.5M atrarjoatum acetate and one 
volume of isopropanol, washed in 70% ethanol and 
resuspended in 1 x TE buffer (10 taM Tris, 0.1 raM 
EDTA pH 7.5). 

2.3. Long-range PCR 

Primers (Table 1) were designed from the published 
MDM2 cDNA sequence (Oiincr et a!., 1992). Each primer 
pair was designed to cross the deletion junctions of 
multiple-sized transcripts (Sigalas et a!., 1996) or accord- 
ing to the predicted exon/intron boundaries by referring to 
the mouse mdm2 gene structure (Jones et ai, 1996; 
Monies de Oca Stma et al., 5996). A long-range PCR 
protocol was earned out, with she above human genomic 
DNA as a template, using an XL PCR Kit (Perkin Elmer, 
Part No. N808-192). For comparison, PCR was also 
carried out on norma! human placental cDNA. The reac- 
tion contained 1 x reaction buffer, 0.8 inM dNTP, 1.1 mM 
MgQAck and 4 units of tTth DNA polymerase, 40 pmol 
of each primer and 100 ng of genomic DNA or 20 
pj cDNA in a total volume of 100 pi. The Song-range 
PCR was performed using a thermal cycler (Perkin-Ehner 
Mode! 480) as follows; 94 X for 2 min; cycles 1-16 at 
94 *C for 311 s, 58-62 6 C for 10 min; cycles 17 -28 at 94 
*C for 30 s, 58-62 "C for 10 min and 15 s of increment 



Table ! 



Prime: 'Torwsfc primer "~™ eDNA position Reverse psinw Position 31 cON A. AjsneaJiag 

(s' ~ r) <»ty (y-.y> <Wf '^pg 3 ^ rC) 

Frii ccEgtgiggocctgtgijte 30- « tgttccgaggciggsatesgig m-m 60 

Fri2 gtgcaataccaacatgtctg H4~3 ' caaeajsctitasawttaBsagC 435-410 60 

Prtt trrtatcHggecagtauflHty 474-497 stttcsgctgatt^ctaeiaci; 654-632 5? 

Pri5 . ' i ' 614-636 «gs!£adecs3£et«a3tg 756-695 60 

W6 <✓ < opt m-m tgtc«actaat«=te!atte slS-793 60 

PH7 " M " h &»g£gttectgbgKttcac 969-949 60 

Prig tpm*x i!>w m~9U casattetacactosstgstcrg 1059- 1036 57 

Pri9 uiw.rss.t*.t,*c 980-1002 a^tasgg3aat««ggascac J211-HSS . 57 

WW £ a u te&tw IJ68-H9! c*«*r I277~i254 60 



' Sequence of cDNA ciem* (Olracr « al- !»t) 
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per cycle, with a final extension of 72 °C for 50 min after 
the lass cycle. 

PCR products were separated by electrophoresis on a 1% 
low melting temperature agarose gel (NuSieve GTG, Flow- 
gen) and visualised by cihidiuttt bromide staining wiili UV 
transillumination. DNA band? amplified from genomic 
DNA were excised and pun fled with a QIAquick Gel 
Extraction kit (Qia < ») * '■' wmg the procedure recom- 
mended by the company. 

2.4, Clowns. PCR products 

Purified PCR products were subcloned directly into the 
pGF-M-T Easy vector (Prornega) following the protocol 
recommended by the company Ligation products were 
transformed into Escherichia cols JM109 and clones con- 
taining Ike desired inserts were identified by PCR screening. 
Plasmids were prepared by using the Wizard Pius SV 
Miniprep system (Promega). 

2.5. Sequencing 

Sequencing was carried out manually by using PCR 
product directly as a template or automatically by using 
piasmid PCR product clone as a template. Manual sequenc- 
ing was performed using the Sequenase Version 2.0 DNA 
sequencing system (Aroersham, Product No, 70770). The 
automated sequencing was carried out in the central core 
facility at the University of Newcastle upon Tyne Medical 
Faculty. The sequences were aligned with the published 
MDM2 cDNA sequence, using the DNASTAR sequence 
analysis software package. 



3. Results 

3.1. The detection of multiple-sized MDM2 transcripts m 
human tumours 

In our previous studies, we have found five alternative- 
sized MDM2 transcripts (MDM2-a, -b, -c, -d and •<?) 
(Sigalas et ai„ 6996). Our present investigation of the 
MDM2 transcriptional pattern in human bladder tumour 
samples, but not in normal bladder tissue (data not shown), 
using RT-PCR has revealed two further transcripts, sized 
813 and 707 bp, which we have designated MDM2-al and 
-g (Fig. 1). Sequencing shows that these two transcripts 
have interna! seqi n 1 t , * 1 ' j> lacks nucleo- 
tides from codoos 28 to 222 and eodons 275 to 300; 
MBM2-g misses nucleotide codons from 28 to 91% and 
iS4 J / 3 to 300. Fig. 2 shows the structure of these tran- 
scripts in relation to the lull-length MDMZ cDNA se- 
quence and pt is •• 1 «:>mnt transcripts {see 
GenBaok accession numbers AF201370 and AP20S37S for 
details of She sequences). 



0,5 fcb — 




I i MOM} rajssenpts -wste ia tub ^IPl" 
frsitiarik'JjsAJi^'. ! ; • scribed (Sigidas et at, 

i , t , [ - t ' < t h mite. 

3.2, MDM2 genomic structure and organisation 

We have used long-range PCR amplification, followed by 
clotting and sequencing to investigate the organisation of the 
human MDM2 gene and in particular to define intron - exon 
boundaries and flanking introntc sequences. Eleven DNA 
fragments were amplified from genomic DNA with the 
primer pairs, shown in Tabie ! , which match to the known 
v . " ■ jqusnces. Comparison of the sequences of 
these PCR products with the published sequences of MDM2 
cDNA clones reveals that the Pril primer pair spans irttrons I 
and 2; while primer pairs Pri2 - 1 0 cover one intron per primer 
pair. However, the region flanked by primer pair Prill, 
covering from the 1235th to 235 1st nucleotide of the 
MDMZ cDNA clone sequence (Oimer el si., 1992), was 
found not to contain any miromc $• x^en:-.- Sequence ital- 
ysis indicates that MDM2 spans approximately 33 kb of 
genomic DMA and is separated by 11 introriS. Exons range 
in size from 50 to k. 1 16 ! hp. The size of the irttrons varies 
from 121 to - 7000 bp (Table 2). Exon-irttron boundary 
sequences of the 5' and 3' splice sites follow the "GT aod 
AG" rule (Table 3; see GenBank accession numbers 
AP144014 AF 544033 for additional mrronsc sequence data). 
Table -1 shows the 3' ends of the introme sequences adjacent 
to the intron-exon boundaries, including branch sites and 
polypyrimidine tracts. The sequences of branch sites have a 
good match with the consensus sequence YCRA*Y (Y: 
pyi ; ildine; R: purine, A*; branch point residue). The dis- 
tances between the branch points and the 3' splice sites vary 
from IS to ! 11 bp. The GT content in the polypyrimidine 
tracts ranges from 53% to 90%. 

Comparison of the structure and organisation of the human 

' i fj? gene described here with that published for the mouse 
gene (Jones et a!., 1996) indicates that the number of the 
exons and nitrons is the same. The size of the coding exons is 
similar (Table 2). However, the sizes of the noucoding trams- 
cril d region including exons 1, 2 and 12 and tire nitrons, 
differ substantially, with the exception of introns I and 3. 
3 3. Analysis of alternatively spiked variants of MOW 
mRNA 

We have detected seven MDM2 transcript variants 
(MDM2-a, ~al, A -c, -d, -e and ~g) previously and 
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presently (Sigalas ct al., 1996), Some of these variant 
transcripts have also been demonstrated in astrocytic 
s by another group (Matsumoto et a!., 1998). 
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Jbe alignment of Uie genomic sequence with the sequen- 
ces of tlrese seven transcripts discloses that ihe muliiple- 
sized transcripts resulted fins the deletion of multiple 
entire exons or part of oxen sequences (Fig. 2). MDM2-a 
■^kips exons 4 to 9; MDM2-aL exons 4 to 9 and exon 
! f,%/7-f>. «vons 4 to 11; MDM2-e, exons S to 9; 
MDM2-d exons 5 to li, most of exon 4 and the 5> end 
♦ <on 2 Mi. Ml e. exons 6 to 11, the 3' end of exon 
s ,nd the 5 : end of exon 12. M>,V2-£ splices out c.xons 
4 to 5 and exons 7 to 11 (Fig. 2) Among the sbpped 
exons exons 4 to 9 and exon 11 were most frequently 
rn sed fhese commonly exchjd 1 <oi *'«th the t< 
rrotion of exon 7) have either less than 60% of OI 
» , ream poly ri tnd t 1 •> 1 ' 
,nore than 40 bp between their upstream branch points 
and 3' sites. Whereas, exons 3, 10 and 12, which :i re 
most frcqaently included m the iranscnj a v sr 
than 60% CT content within their upstream poly;: > 
dine tracts and loss than 40 bp between then upstream 
branch points and 3' splice sites. Since exon 6 contains 
interrupted codons at bom ends, the MDM2-g variant, 
wMch splices out severs! exons tip- and downstream of 
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GGAdCfiJSgtgctggc - -iutroa 1— tetcccagCTtiTSTT 
GATCCftGgtaageac-- iatxon 2- -ccttgtsgGCAft|3Sj 

r. xccci<53t c at?t at- - intrem 3 - - tctcatagSSTAGAC 
27 28 

GAMSSSstasgcKq- -inwon 4—tattt.cagSSXCTTT 
52 53 

AGOSAGgtaattct" iatron 5~ -tetaeaagS&AAftTA 

CAGCftSGgtaaettea-- tntron 6-'tetc£cagftA£S£TC 

513 115 
TOW&^gt«atcta- -Introo 7~ -aegcttagOasefTG 

GAgACaegcat.af.at-- inteon B--atatcsagAAGSA&A 
J67 



TGATCftGgt^: at.al • ■ ir.tron 10 -t-.etattsgGJ&TA'-'c 
274 «5 

cr» Dtaa-ttar >ntron ii-caetga»g§&£TJOT 
300 301 



s. Underlined B uc:e«;<fcs eneodc anano acids (numbered 



excn 6, has a shifted reading frame after codon 28. The - 4. Discussion 
MDhil-e variant, which also iavoives a detetiem junction 

w .th an intemmted codon sequence, has a shift m the in our previous studies and the results presented here, 

retina frame after codon 484 detected multiple-sized MDM2 transcripts » humaa 
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tumour tissues, but not m nom..i! tissues Detection of 
i MDM2 mPNAs varied, and 
appeared to be relatively abundant in most samples. 
These variants encoded protein products in vitro and 
were found to transform NIH3T3 cells and to be associ- 
ated wit}} high-grade and late-stage human cancer (Sigaias 
el a!., 5996 and unpublished data). Our data were 
supported by the observations of other groups, who 
reported thai" alternatively spliced mdm2 transcripts pro- 
moted tumour formation in mouse model (Fridman et al., 
2 - Stciinntm et at 2004} Si se collective jbserva 
{ions suggest that alternatively spliced forms of MDM2, 
, , 'its r test! Atth differing fundi ial 
capabilities, may play an important role in tumour devel- 
opment The human genomic map presented hers 
enable us to relate the genomic structure and organisation 
of MDM? to the appearance- of variant transcript forms 
and pro-ride a basis for considering alternative splicing 
mechanisms. 

Pre-mRNA sph tig trv w )) ci e excision of intron 
sequences and the ligation of exon sequences, lrt cases ot 
alternative splicing, the excision may occur at cryptic splice 
sites; exons may be skipped and introns may be retained 
of the cxtm intron boundaries or the 
t v, ' ene indicates that the MDM2~a, -al, 4>> -e 
and -g vatiam transcript fotms {Fig. 2) result from multiple 
entire exon skipping, because the internal deletion junc- 
tions correspond exactly to the location of exon-tntron 
boundaries. However, the deletion junctions of MDM2~d 
and -e forms do not correspond to the boundaries between 
exons and introns, and there are no consensus sphctng 
sequences surrounding them to indicate the possible use of 
cryptic splice sites, This suggests that they may have 
resulted from an unusual and possibly aberrant splicing 
mechanism. 

The regulation of alternative splicing involves both c£s 
elements and frow-acting factors. The cts elements in- 
clude the 5' and 3' splice sites, a branch site and a 
polypynrnidme tract between the branch point and the 3' 
splice she It has been demonstrated that a short distance 
between the branch point and the 3' splice sue and high 
C/T content at poiypyrimtdme tracts give rise to high 
, , ■ ,-in-m et al . 19SS; 

Libri « al, 1989). bur data show that all the 5' and 3' 
splice sites of the human MDM2 gene obey the "GT" 
and "AG" rule. The branch sites also have a good match 
with the consensus sequence. However, the distances 
between branch points and 3' sites, and the percentage 
C/T content vary between introns. The exons most 
frequently retained in the splice variants have shorter 
distances between their upstream branch points and 3' 
splice sites and/or a higher percentage C/T content in 
their upstream polypyritnidine tracts, compared with those 
exons commonly excluded (Table 2). U suggests that the 
short distance between the branch point to the 3' site and 
the poSypyrimiduic tract with high C/T content confer 



high splicing efficiency to thetr adjacent splicing sites, 
which is consistent with the observation reported by 
Smith d al. (1989), Goux-Pefiefan et a!. (1990), Mueller 
st al. {1947). However, exon 7 was spliced out with a 
high frequency although there is only a short distance 
between its branch point and 3' splice site and there is 
also a high C/T content in its upstream poJypyrimidine 
tract The explanation may be that it is spliced out along 
with upstream exons The cider of mtron remove 5 is 
governed by preferential binding of splice factors rather 
than in a sequential numerical order {Lewin, 1994} It 
- be possibk that e»on 7 processes splicing before 
exon 6 does, and the S' end of exon 7 may in some 
circumstances be hgated preferentially » the 3' end of 
exon 6. If exon 6 has tow splice strength, then in the 
process of ligation with its upstream exon, it may be out- 
competed and skipped out together with exon 7. 

Currently, we are not clear why die alternative spliced 
transcripts appear preferentially in tumour samples, espe- 
cially in advanced stage and high grade, but not in normal 
tissues (Sigaias et a!., 1996; Battel et al., 2002). Although 
variant MDM2 spliced transcripts have been reported in 
normal tissues in one study (Battel d al, 2004), we failed to 
delect these isoforms in noncancerous tissues. It has been 
proposed that a mRNA surveillance system exists in cells, 
whiel protects them from errors of transcription, mRNA 
processing, or mRNA transport (Pulak and Anderson, 
1<>93). Mistakes are not uncommon in splicing of RNA 
from'complex genes Exons can oecasii ' ' 
(Migro et ai., 1991). In the norma! situation, the surveillance 
system would probably degrade < ml:\ i > 1 l < phcing 
errors as they are transported to the cytopU: 1 M . peci t. 1 ; 

-n may not function et 
and, const icntly, tl ' e variants may escape degrada- 
tion [ t is also p > hat here in nutat n i--.il.' i ton 
region that cause alien a f ing and tl enceofthe 
n< f It would be of interest to 
test this hypothesis by investigating in ton nucleotide 
e nces in tumour samples that show expression of 
3.it> i - i 1 forms. However, other models can be 
envisaged; "for instance, we cannot role out the possibility 
tbas trans-acting factors are involved in the alternative RNA 
processing by blocking some splice sites and'or enhancing 
other splice sites. 

In conclusion, we have detected two novel MD.M2 
alternatively spliced transcripts and have also defined the 
structure and cm ganisatton of the human MDKfl gene. It) 
addition we have related this information to potential 
mechanisms by which alternatively sized MDM2 transcripts 
are generated. As the alternatively spliced MDM2 mRNAs 
have been shown to possess oncogenic potential and to 
correlate with advanced malignancies to tumour progress 
• Haines et ;tL 1994; Sigaias et al„ 1996; Steinman « ai, in 
press), our data may is * n ' aidia . o ; sarcomas 
displaying MDM2 amplification and alternative splicing of 
MDM2 transcripts. 
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Is "]unk" DNA Mostly Intron DNA? 

Gane Ka-Shu Wong/' 3 Dougias A. Passey, 1 Ying-zong Huang, 1 Zhiyong Yang, 1 and 
JunYu 1 - 2 
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Genome Centei, institute ot Genetics. Chinese Academy of Sciences, Setting, China 

Among higher eukaryotes, very lit tie of the genome codes for protein. What is in the res! of the genome, or 
ilc" Dl i- i to I fmost o the enome? I it possible that much of 

this "jtink" h intron ONA? Thh is not a question tii.it on be answered just by looking .it the published data, 
even from th . t ume rat thert are no gene* in a st ), just because 

no genes were annotated. We introduce another approach to this problem, based on an analysis of the 
cDNA-to-genomic alignments, in at! of the complete oi nea . genomes from the multicellular 

oi misms. 1 >y onclu iot is tn.it, m animals but not in plants, most of the "junk" Is intron DNA. 



Among higher eukaryoies, very Httte of the genome 
codes for protein What ss in the rest of the genome, or 
the "junk" DNA, tot, in //-/mo srmiens, Is estimated to 
bo almost 97% of the genome? if a region is gene-poor, 
is that because there ate vast desert* of intergenic DNA 
between adjacent genes, or is that because the few 
genes thai ate hsete are huge, with enormous in; coos? 

First a tew deimitions are needed. We consider 
only the euc inornate: portion of the* genome. The het- 
erochromatic portion ■e.g.. centromeres and toko 
m< s i.egeiy devoid of genes 

It is extremely di.fhcuft to clone, extremely polymor- 
phic, and e.nbkeiv to be sequenced cortectiy anytime 
soon. We define the exerts, and nitrons as •'intragenic 1 ' 
and -s ything el s ot n Fins is not t m 
ply thai imergenic DNA is nonfunctional, especially as 
we have incorporated the promoters una our detlni- 
tion. However, promoters arc difficult to identify, 
whereas exons and hurons are reliably Identified by 
cONA-kegenomic alignments. Lastly, eve will use the 
term "genomic length"" to indicate the stun of the ex- 
ons and Inrrons in a given gene and "< ONA length" to 
indicate the stmt of only the exons. 

been after a genome is completely sequenced, it is 
not a straightforward mattes to determine the inter- 
genic fraction. Indeed, any assessment that is based 
only on the fraction of the genome that has not been 
identified by the gene annotations is likely to be an 
overestimate of the underlying reality. Consider bow 
I'll solo "i > > c \, s i '\cdiites tf he 

Caett'ffhabdith ek-pws Sequencing Consortium 1998; 
Uunhumetfi i*»i hru'e! i^uo \i, %vf( >uii jogo. 
Adams ci al. 2000; Hattori et ;sb 2000. employ a com- 
bination of ICT/cDNA/protein alignments and ub ini- 

3 Ccnrrespcn»dteg author, 

SIWAfL gksw@ti.waASngtw.edw,- FAX (206) 685-7344. 

1 t 1 , r t i s " 1 

gr.l 48900. 



tto exon-ptedietioti programs. Given the incomplete 
stmtthciS vD ipi.km m iu< st ,J th. mm 

programs, even if parts of ccttat gi m s ,.ir i n t | 
by the expeiimem.fi dan. f bete nee two problems (Bnc- 
setandGmgo )- • ■ ^ > < < »te ts that the 

exon-pjedictK n j i .n m < n t thntdy unttans- 
lated non-coding exons (i.e., the f'TRsy The second, 
more important, issue is ■hat these programs are not 
pamuilaih j ) en ni j fie \ i ■ • lb i 
are three reasons: CI) The signal-f.o-noise ratio can be as 
low ;is iilOm. for the extreme ease of a 100-bpexon 
juxtaposed next to a 1004cb intron: (2) the data sets 
used to trait) these programs tend to uoder-Kpresent 
the large difficult-to- sequence genes; and ■ the 
codon-nsage statistics., by which the exons arc initially 
Identified, are rue as informative- lor the large genes of 
certain organisms (Wright 1990). 

The extent of the large-genes problem is organism 
dependent. The determinant is the distribution of ge- 
nomic lengths, U the genomic lengths are distributed 
over many orders of magnitude, then failure to anno- 
tate even a small fraction of the largest genes will leave 
a much larger fraction ot the genome unarmotated. In 
this scenario, there is a critical difference between the 
■oilovving no) seemingly similar quantities: the frac- 
tion oi the; genes in the: genome that is correctly iden- 
tified and the traction of the genome sequence that is 
labeled as intragenic. The first quantity is far more 
likely to be correct than the second. H is possible that 
the total gene count is essentially cot reet, white, at the 
same time, the intragenic fraction Is significantly un- 
detestimated and the Intergeuie fraction is signifi- 
cantly overestimated, indeed, this is precisely the prob- 
lem lot the animal genomes. 

Our solution is to determine the distribution of 
genomic lengths entirely from <:D NA-to-g«nomk: 
alignments (i.e.. independent of the exon-predSctiort 
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programs). Then, compare the mean genomic length 
to the mean gene-to-gene distance. The former is taken 
from the cDNA alignments, but the tatter is computed 
as the ratio of the enehromatic genome size, divided by 
the gene count, taken from the annotations. Reliable 
results am csptmtd for PmM,phi!a >>, , i , i^.-V and 
Cdt-tiorhabJith (aeemo. In im genome ([iKiKim, ioi 
these organisms is complete and estimates of the gene- 
to-gene distan > vaitabl bud in/rots thulkma 
the published chromosomes (Lin c;t ai. 1999; Mayer et 
a 9 l u, >«t agree to 4 V>. , so vvc i an safe h i j trajx iatt to 
the entire genome. In contrast, for II. sapiens, the pub- 
lished chromosomes (Dunham et a[. 1909; Matron e!: 
ai. 2000 > differ lay 243%, reflecting the heterogeneity 
in the gene densities of warm-blooded vei tebrates (Ber- 
nard! 2000). Coupled with the difficulties ot deienmn- 
ing the mean genomic length, a result of the lack of 



large genomic contigs, ere reler extensively to the 
model organism results to guide our interpretations of 

RESULTS 

Figure I depicts i he distribution of genomic lengths lor 
H. sapiens. A mu^i'm^it r, < ..rVy<m,., and !. t)>alhm,i 

I hie is i tun lerical sui m v. The animal di i 
tions spars 2-3 oiders of magnitude, hut the plant dis- 
tribution spans only one order of magnitude, the im- 
plication for td 
in consider* ,„ 1 s vs i 

have to he unidentified for halt of she intragenic space 

< fit missing. Tim figures n t n i 1 < and S.0% it 
one extrt mo, , • < t, i " . s <.hi u iu 
at the other extreme, in .4. thalUma. Furthermore, the 
only organism in which the inf.ergenk: fraction is 
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figure 1 i I f i < i > r i 

.a-ai):*>f)5/s thahann. Dark stwii:io indicates sOorui airs. Weal; hits (liqliHy slaiomm rf:r:ue-^uo cte.iA -!'■><;•: nun tic atignmetiU with s.S < : m>iv 
or <50% of tht- cDNA length .liioncd An overvtfk-lmiricj majority of these weak hits ai« actually complete- .diqrttmaas win o»iy one u 
two exoru. instances in which <50% of ti t is aikiued represent / .3%, 3.3%, 1 .2%, rind 0 ' of i yj^uev in riie f;>;.:r oroariisnis 
respectively. 
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greater than 10% is ,4. 'Jwiiana, even though we have 
inducted the minor correction tor nested genes (genes 
on the reverse strand or inside an nitron). This correc- 
tion is computed by counting the occurrences or' 
nested genes in our cDNA alignments, and adjusting 
for the fact, thai: we do not detect every such occurrence 
because we do not have ai! of the cDNAs. 

The main uncertainty in our method is that we 
must extrapolate from a subset of the genes to the en- 
tire genome to determine the mean genomic length. 
There will he sampling biases, but they can be catego- 
rized and suixategoriz.ed as follows: (it the extent to 
which cDNA data ate enriched for huge ot small genes.. 
(2) the extent to which genomic data are biased for 
large or stnal! genes, am! then, are the gene-rich re- 
gions done tns; by scout tving ptoiects? Ate the cor- 
tigs large enough for us to align the large genes? 

We will argue that the problem is primarily in the 
genomic data, not the cl>NA data, Furthermore to the 
extent that there are sampling biases, the tendencies 
are always to underestimate the mean genomic length 
and to overestimate the intergenk. fraction. 

There are two reasons to suspect that biases in the 
cDNA data will cause us to underestimate the mean 
genomic length. Keep in tntnd that large gene- ace 
highly correlated with laige cDNA? (this pajwr; data 
not shown). The first explanation is that full-length 
cDNAs are exttemely difficult to clone, given the ease 
with which RNA molecules are degraded and the in- 
trinsic bias in the cloning system for smaller inserts. 



The second reason is that large 8NA molecules require 
more time to transcribe, so large genes might he less 
highly expressed and more difficult to isolate. How- 
ever, this expectation is incorrect, ix-c.ause the tran- 

ot the expression levels, in it $,iph-ns. we aligned the 
l,as<t,H')2 ESTs in GenBank against our c.DNA data. 
Multiple reads front the same clone were counted only 
once, figure 2 shows that there is no significant varia- 
tion in EST coverage as a function of genomic length. 
Notice that the normalization procedures (Hillie.c et al. 
1996) applied to the fiST libraries do not affect the rare 
trasrscripis, in whicl) we were looking for an effect. The 
conclusion is that cDNA data, extracted from Gen- 
Bank, can Ik? representative of ah genomic: lengths. 

Genomic data are biased in twowavs. Hrst, there is 
a sociologic bias toward sequencing gene-rich regions 
tirst. >-Xc :vh even whern a gc;nome is complete.. lack of 
long-range contiguity, on the; scale of the largest genes, 
will reduce the estimate of the mean genomic length, 
because any breaks in the alignment are most hkely to 
occur across the largest introm. Both issues are rel- 
evant in the H. h</»V/iv data, in figure \ we demon- 
strate that the aligned data are biased toward tit;.'.- rich 
genes, which are of smaller genomic lengths (Bernard) 
2(100). As for contiguity, we estimate the extent of the 
problem by computing the ratio of the median ge- 
nomic eotitig sue to the genomic length of the 95th 
[x-fcentiie gene. Ideally, this ratio would he much 
greater than one. Table i shows that it is much greater 
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done are counter! only once- "there- is no obvious bias, indicating that cDNAs for 
genes of every genomic length are equally easy to isolate. 



obllli 



). idea 



than one in D, mvhmo^ma and .-i. b.shno.w. ft is only 
moderately greater than ok: in C. eiegam, but that is 
less important for this organism, bee 



adly distribute 
si. 2, a< 



, the i 



leneghs 
Hoi' 

it would have been even smaller had ,-.<• 
not used genomic data from a new division 
of GenSank in which alt of the overlapping 
Clones have been joined together (Jang et 
a!. 1999). 



ran we interpret the H. sapiens data? If we 
accept the traditional gene count of 70,000, 
our mean genomic length erf 43.4 kb pre- 
dict;: an intergenk traction of 10%. Sup- 
pose we inflate our estimate by "tie same 
34% discrepance that was observed between the twoi). 
tnelanogusler data sets. The- gene count that would be 
consistent with the same hid;, mrergeoie fraction is 



Number of cDNA: fraction aligned - 0.146 

i H Strong hit j 
Weak hits ■ 



We can estimate the severity of these 
biases with the ditfercm versions of the D. 

„ ;--,„s,'t i .i m dtta ,t, i,„b w 
repeated the alignments with, the same 
c DNA data but switched to the 34.9 Mb of 
finished elone-by-cfone. genomic data that: 
was available prior to the completion of the 
whole-genome shotgun t Adams et al. 
2000). The contig quality measure is then 
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■sulfa 

off ti 



? mark by 34% 
s aligned in t 



'oth 

data sets, we iind that 16% of this effect is 
attributable to the contiguity problem. The 
other 18% is att ributable to the bias toward 
sequencing gene-rich regions first. An even 
mote dramatic example of these biases is 
Mm musculus, which has a contig quality 
measure of 0.3 arid a mean genomic length 



025 0.5 0.75 
cDNA's GC 



i> it 

pitted the n ,t list teat IO a d particular GC content aligned to genomic 
seqeoce, given that only 360 Mb of nenredu ndata. fifiooed tienorok seqoence 
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GC(mean) « 0,360, GC(median) s 0.366 
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GC(mean) * 0.359, GC(median) ■ 0.362 
1200- 




of 35,000 to 45,000 for 
much imeigenle DNA 



at Midi a i, _ l i tuMni- 

boo of GC content for genomic DNA is bimodat, 
as in Figure 4 Fitting the data to a mm of Gaus- 
sian* reveals that the malt) mode is centered at 
0382, which ts almost identical to the 0390 GO 
content of the aligned A. thalimui genes. The rela- 
tive tttio ul tb<-te.t, mcuiesinipiksan hiiotoik 
fraction of 30%. winch o smaller than the 46% 
estimate derived from gemwti« length argu- 
ments but not une\\;?eetedly so, because some of 
■he iniergenic DNA could have a GC content Oast 
is simllat to the mtt t^cmc DNA, the reason whv 
toisbimooality has trot been reported pievionsly 
is that it ts extremely sensitive to how the data 
are plotted. SpeeiiNaLlv, the histogram bins most 
be smaller than the mean genome: length, and 
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■v the data are plotted. 



;>me annotation efforts eo 
id) intergetiic- DNA? One 
•.s features of the recent a 



GC(mean) ■ 0,368, GC(median) - 0.374 




than intragenic DNA, iuqee 
biwodal However, the? "bin 
tr.vv tin; data aro plotted. ■: 
bins over which the CC co; 
lib, ^ipyrtivdv. Bins:,-fo;ar 
oi 2.6 kb obscure the- effec 
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is "junk" DNA Mostly Inlroi) DNA? 
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In all likelihood, 
large genes, with 



nptedicttot 



with absolutely 
each of these re< 
no counterpart in the BT 
which are not being detects: 
programs. After accounting f< 
tier of the presently unannotated regions, will likely 
attributed to untranslated non-coding exons ai 
flanking i.nirons We mxm teitetate that the fraction 
the genes chat h. missing docs not have to be large 
c | 1 hn tit f 1 reg 
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fraction or the precise gene count but, at the risk of 
extrapolating from a limited number of genomes, the 
difiefcnccs between plants and animals. 'There is evi- 
dence that plant and animal genomes are organized in 
different ways, in H. sapU-m. large genes, are caused by 
a combination of large natrons and more trnrons per 
gene (this paper; dart! not shown). At least .S5.4%or'fhe 
total length of the inttons in our/-/, sn/kens data ts doe 
to interspersed repeats (e.g., Ahi and ,', ? i. ! he true frae- 



oldcr , 



ecp.ei 



e l( ie 



serisns, cannot be Iderilihed by existing methods (Smil 
1996). Analysis of otlhologons genes in Fugu mbripes 
and H. sapiens teveais that touch of the 10- fold differ- 
ence in lite sizes of these two genomes cat! he ex- 
plained by differences in intron sizes (Hgaretal, loogi. 
III contrast, analysis of syntenic loci among grasses re- 
veals that much of the 40-fold difference in the ska ot 
these genomes ran be explained by their extensive re- 
peat-filled miergeote region;. [SanMigoei ct al. }9n6. 
Bennetzen et ai,' l998). 

The; conclusion is that, in animals, most repeats 
integrate into intron DNA, but, in plants, most repeats 
integrate into intergenk DNA. h there ding dif- 

ferent about the nature of the repeats that insert into 
animals and plants? Does this dichotomy reflect differ 
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seeaieisee, ate: intention is it, i-nstu-e mat, if anything, ere 
i;;ede.:es;.iaaae: the mean gnomic Svngrtt. 
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